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HUMAN OXIDOREDUCTASE PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of oxidoreductase proteins and 
5 to the use of these sequences in the diagnosis, treatment, and prevention of cell proliferative disorders 
including cancer; endocrine, metabolic, reproductive, neurological, viral, and autoimmune/inflammatory 
disorders, and in the assessment of the effects of exogenous compounds on the expression of nucleic 
acid and amino acid sequences of oxidoreductase proteins. 

10 BACKGROUND OF THE INVENTION 

Many pathways of biogenesis and biodegradation require oxidoreductase (dehydrogenase or 
reductase) activity, coupled to the reduction or oxidation of a donor or acceptor cofactor. Potential 
cofactors include cytochromes, oxygen, disulfide, iron-sulfur proteins, flavin adenine dinucleotide 
(FAD), and the nicotinamide adenine dinucleotides NAD and NADP (Newsholme, E.A. and Leech, 
15 A.R. (1983) Rinrh^istrv for tM M^ical Sciences . John Wiley and Sons, Chichester, U.K. pp. 779- 
793). 

Reductase activity catalyzes the transfer of electrons between substrate(s) and cofactor(s) with 
concurrent oxidation of the cofactor. The reverse dehydrogenase reaction catalyzes the reduction of a 
cofactor and consequent oxidation of the substrate. Oxidoreductase enzymes are a broad superf amily 
20 of proteins that catalyze numerous reactions in all cells of organisms ranging from bacteria to plants to 
humans. These reactions include metabolism of sugar, certain detoxification reactions in the liver, and 
the synthesis or degradation of fatty acids, amino acids, glucocorticoids, estrogens, androgens, and 
prostaglandins. Different family members are named according to the direction in which their reactions 
are typically catalyzed; thus they may be referred to as oxidoreductases, oxidases, reductases, or 
25 dehydrogenases. In addition, family members often have distinct cellular localizations, including the 
cytosol, the plasma membrane, mitochondrial inner or outer membrane, and peroxisomes. 

Tetrahydrofolate is a derivatized glutamate molecule that acts as a carrier, providing activated 
one-carbon units to a wide variety of biosynthetic reactions, including synthesis of purines, pyrimidines, 
and the amino acid methionine. Tetrahydrofolate is generated by the activity of a holoenzyme complex 
30 called tetrahydrofolate synthase, which includes three enzyme activities: tetrahydrofolate 
dehydrogenase, tetrahydrofolate cyclohydrolase, and tetrahydrofolate synthetase. Thus, 
tetrahydrofolate dehydrogenase plays an important role in generating building blocks for nucleic and 
amino acids, crucial to proliferating cells. 

Intracellular redox status plays a critical role in the assembly of proteins. A major rate limiting 
35 step in protein folding is the thiol.disulfide exchange necessary for correct protein assembly. Although 
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incubation of reduced, unfolded proteins in buffers containing defined ratios of oxidized and reduced 
thiols can lead to folding into native conformation, the rate of folding is slow, and the attainment of the 
native conformation decreases proportionately with protein size and the number of cysteine residues. 
Certain cellular compartments such as the endoplasmic reticulum of eukaryotes and the periplasm^ 
5 space of prokaryotes are maintained in a more oxidized state than the surrounding cytosol. Correct 
disulfide formation can occur in these compartments, but it occurs at a rate that is insufficient for 
normal cell processes and inadequate for synthesizing secreted proteins. 

Protein disulfide isomerases (PDIs), thioredoxins, and glutaredoxins are able to catalyze the 
formation of disulfide bonds and regulate the redox environment in cells to enable the necessary 
10 thiolrdisulfide exchanges. Each of these classes of molecules has a somewhat different function, but all 
r, belong to a group of disulfide-containing redox proteins that contain a conserved active-site sequence 
" ^iquitously distributed in eukaryotes and prokaryotes. PDIs are found in the endoplasmic - 

reticulum of eukaryotes and in the periplasmic space of prokaryotes. PDIs function by exchanging their 
own disulfide for thiols in a folding peptide chain. In contrast, reduced thioredoxins and glutaredoxins 
15 are generally found in the cytoplasm and function by directly reducing disulfides in the substrate 
proteins. Thioredoxin (Trx), a heat-stable, redox-active protein, contains an active site cysteine 
disulfide/dithiol. Oxidized thioredoxin, Trx-S, can be reduced to the dithiol form by NADPH and a 
specific flavoprotein enzyme, thioredoxin reductase Reduced thioredoxin, Trx-(SH), participates in a 
number of redox reactions mostly linked to reduction of protein disulfides. Trx and thioredoxin 
20 reductase (TR), together with NADPH, form a redox complex in which TR catalyzes the electron 

transport from NADPH to Trx. The reduced thioredoxin then functions as an electron donor in a wide 
variety of different metabolic processes. 

Disulfide-containing redox proteins not only facilitate disulfide formation, but also regulate and 
participate in a wide variety of physiological processes. The thioredoxin system serves, for example, as 
25 .,• a hydrogen donor for ribonucleotide reductase and controls the activity of enzymes by redox reactions. 
Mammalian thioredoxin (MT) acts as a hydrogen donor for ribonucleotide reductase and methionine 
sulfoxide reductase, facilitates refolding of disulfide-containing proteins, and activates the 
glucocorticoid and interleukin-2 receptors. MT also modulates the DNA binding activity of some 
, tt^Ptionfaaorsdmerdi^^ 
30 . nuclear factor Ref-1. The importance of me redox regulation of transcription factors is 
" V * fOS o*"*™ where a Point mutation of the thioredoxin-modulated cysteine residue results in 
constitutive activation of the AP-1 complex. Thioredoxin, secreted by cells using a leaderless pathway, 
stimulates the proliferation of lymphoid cells, fibroblasts, and a variety of human solid tumor cell lines. 
Furthermore, thioredoxin is an essential component of early pregnancy factor, inhibits human 
35 immunodeficiency virus expression in macrophages, reduces H 2 0 2> scavenges free radicals, and protects 
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cells against oxidative stress (Abate, C. et al. (1990) Science 249: 1 157-1 161 ; Rosen, A. eft al. (1995) 
Int Immunol. 7: 625-633; Tagaya, Y. et al. (1989) EMBO J. 8: 757-764; Newman, G. W. (1994) J. 
Expt. Med 180: 359-363; and Makino, Y. (1996) J. Clin. Invest. 98: 2469-2477). 

Short-chain alcohol dehydrogenases (SCADs) are a family of dehydrogenases that share only 
5 15% to 30% sequence identity, with similarity predominantly in the coenzyme binding domain and the 
substrate binding domain. In addition to the well-known role in detoxification of ethanol, SCADs are 
also involved in synthesis and degradation of fatty acids, steroids, arid some prostaglandins, and are 
therefore implicated in a variety of disorders such as lipid storage disease, myopathy, SCAD deficiency, 
and certain genetic disorders. For example, retinol dehydrogenase is a SCAD-family member (Simon, 
10 A.etal.(1995) J. Biol. Chem. 270:1 107^1 112) that converts retinol to retinal, the precursor of retinoic 
acid Retinoic acid, a regulator of differentiation and apoptosis. has been shown to down-regulate genes 
involved in cell proliferation and inflammation (Chai, X. et al. (1995) J. Biol. Chem. 270:3900-3904). 
In addition, retinol dehydrogenase has been linked to hereditary eye diseases such as autosomal 
recessive childhood-onset severe retinal dystrophy (Simon, A. et al. (1996) Genomics 36:424-430). 
15 Propagation of nerve impulses, modulation of cell proliferation and differentiation, induction of 

the immune response, and tissue homeostasis involve neurotransmitter metabolism (Weiss, B. (1991) 
Neurotoxicology 12:379-386; Collins, S.M. et al. (1992) Ann. N.Y. Acad. Sci. 664:415-424; Brown, 
J.K. and Imam, H. (1991) J. Inherit Metab. Dis. 14:436-458). Many pathways of neurotransmitter 
metabolism require oxidoreductase activity, coupled to reduction or oxidation of a cof actor, such as 
20 NAD7NADH (Newsholme, E.A. and Leech, A.R. (1983) Fiorhemfrtr y for the Medical Sciences , John 
Wiley and Sons, Chichester, U.K. pp. 779-793). Degradation of catecholamines (epinephrine or 
norepinephrine) requires alcohol dehydrogenase (in thebrain) or aldehyde dehydrogenase (in peripheral 
tissue). NAD^ -dependent aldehyde dehydrogenase oxidizes 5-hydroxyindole-3-acetate (the product of 
5-hydroxytryptamine (serotonin) metabolism) in the brain, blood platelets, liver and pulmonary 
25 endothelium (Newsholme, E.A. and Leech, A.R. supra p. 786). Other neurotransmitter degradation 
pathways that utilize NADVNADH-dependent oxidoreductase activity include those of L-DOPA 
(precursor of dopamine, a neuronal excitatory compound), glycine (an inhibitory neurotransmitter in the 
brain and spinal cord), histamine (liberated from mast cells during the inflammatory response), and 
taurine (an inhibitory neurotransmitter of thebrain stem, spinal cord and retina) (Newsholme, E.A. and 
30 Leech, A.R. supra, pp. 790, 792). Epigenetic or genetic defects in neurotransmitter metabolic pathways 
can result in a spectrum of disease states in different tissues including Parkinson's disease and inherited 
myoclonus (McCance, K.L. and Huether, S.E. (1994) Pathophysiology , Mosby-Year Book, Inc., St. 
Louis, MO pp. 402-404; Gundlach, AX. (1990) FASEB J. 4:2761-2766). 

3-Hydroxyacyl-CoA dehydrogenase (3HACD) is involved in fatty acid metabolism. It 
35 catalyzes the reduction of 3-hydroxyacyl-CoA to 3-oxoacyl-CoA, with concomitant oxidation of NAD 
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to NADH, in the mitochondria and peroxisomes of eukaryotic cells. In peroxisomes, 3HACD and 
enoyl-CoA hydratase form an enzyme complex called Afunctional enzyme, defects in which are 
associated with peroxisomal Afunctional enzyme deficiency. This interruption in fatty acid metabolism 
produces accumulation of very-long chain fatty acids, disrupting development of the brain, bone, and 
5 adrenal glands. Infants bom with this deficiency typically die within 6 months (Watkins, P. et al. 
<1989) J. Clin. Invest 83:771-777; Online Mendelian Inheritance in Man (OMIM), #261515). The 
neurodegeneration that is characteristic of Alzheimer's disease involves development of extracellular 
plaques in certain brain regions. A major protein component of these plaques is the peptide amyloid-P 
(Ap), which is one of several cleavage products of amyloid precursor protean (APP). 3HACD, which 

10 has been shown to bind the AP peptide, is ova-expressed in neurons affected in Alzheimer's disease. In 
^addition, an antibody against 3HACD can block the toxic effects of Ap in a cell culture model of 
^Alzheimer's disease (Yan, S. et al. (1997) Nature 389:689-695; OMIM, #602057). 

17P-hydroxysteroid dehydrogenase (1 7PHSD6) plays an important role in the regulation of the 
male reproductive hormone, dihydrotestosterone (DHTT); 1 7pHSD6 acts to reduce levels of DHTT by 

15 oxidizing a precursor of DHTT, 3a-diol, to androsterone which is readily glucuroni dated and removed 
from tissues. 17PHSD6 is active with both androgen and estrogen substrates when expressed in 
embryonic kidney 293 cells. At least five other isozymes of 17PHSD have been identified that catalyze 
oxidation and/or reduction reactions in various tissues with preferences for different steroid substrates 
(Biswas, M.G. and Russell, D.W. (1997) J. Biol. Chem. 272:15959-15966). For example, 17pHSDl 

20 preferentially reduces estradiol and is abundant in the ovary and placenta. 17PHSD2 catalyzes 
oxidation of androgens and is present in the endometrium and placenta. 17PHSD3 is exclusively a 
reductive enzyme in the testis (Geissler, W.M. et al. (1994) Nature Genet 7:34-39). An excess of 
androgens such as DHTT can contribute to certain disease states such as benign prostatic hyperplasia 
-and prostate cancer. 

25 >v Steroids, such as estrogen, testosterone, corticosterone, and others, are generated from a 

common precursor, cholesterol, and are interconverted into one another. A wide variety of enzymes act 
upon cholesterol, including a number of dehydrogenases. One such dehydrogenase is 3-oxo-5-a-steroid 
dehydrogenase (OASD), a microsomal membrane protein highly expressed in prostate and other 
-androgen-responsive tissues. OASD catalyzes the conversion of testosterone into dihydrotestosterone, 

30 which is the most potent androgen. Dihydrotestosterone is essential for the formation of the male 

-phenotype during embryogenesis, as well as for proper androgen-mediated growth of tissues such as the 
prostate and male genitalia. A defect in OASD that prevents the conversion of testosterone into 
dihydrotestosterone leads to a rare form of male pseudohennaphroditis, characterized by defective 
formation of the external genitalia (Andersson, S.,-et al. (1991) Nature 354:159-161 ; Labrie> F., et al. 

35 (1992) Endocrinology 1 31 :1571-1573; OMIM #264600). Thus, OASD plays a central role in sexual 
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differentiation and androgen physiology. 

The discovery of new oxidoreductase proteins and the polynucleotides encoding them satisfies a 
need in the art by providing new compositions which are useful in the diagnosis, prevention, and 
treatment of cell proliferative disorders including cancer; endocrine, metabolic, reproductive, 
5 neurological, viral, and autoimmune/inflammatory disorders, and in the assessment of the effects of 
exogenous compounds on the expression of nucleic acid and amino acid sequences of oxidoreductase 
proteins. 

SUMMARY OF THE INVENTION 

10 The invention features purified polypeptides, oxidoreductase proteins, referred to collectively as 

"ORP" and individually as "ORP-1," "ORP-2," "ORP-3," "ORP-4," "ORP-5," "ORP-6," "ORP-7," 
"ORP-8," "ORP-9," "ORP-IO," "ORP-11," "ORP-12," "ORP-13," "ORP-14," "ORP-15," "ORP- 
16," "ORP-17," "ORP-18," "ORP-19," "ORP-20," "ORP-21," "ORP-22," "ORP-23," "ORP-24," 
"ORP-25," "ORP-26," "ORP-27." In one aspect, the invention provides an isolated polypeptide 

15 comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence 
having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ED NO:l-27, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-27, and d) an immunogenic fragment of an amino acid sequence selected 

20 from the group consisting of SEQ ID NO:I-27. In one alternative, the invention provides an isolated 
polypeptide comprising the amino acid sequence of SEQ ID NO:l-27. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising an 
amino acid sequence selected from the group consisting of a) an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence having at least 

25 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
27, c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO:l-27, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 -27. In one alternative, the polynucleotide encodes a polypeptide selected 
from the group consisting of SEQ ID NO:l-27. In another alternative, the polynucleotide is selected 

30 from the group consisting of SEQ ID NO:28-54. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-27, b) a naturally occurring amino acid sequence having at least 90% 

35 sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, c) 
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a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27. In one alternative, the invention provides a cell transformed with the 
recombinant polynucleotide. In another alto-native, the invention provides a transgenic organism 
5 comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino add 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l-27, c) 

10 a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27. The method comprises a) culturing a ceil under conditions suitable for 
expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide 
comprising a promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) 

15 recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 

20 consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO:l -27, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO:l r27. 

The invention further provides an isolated polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 

25 SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 contiguous 
nucleotides. 

30 Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 

said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 

35 polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
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an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising 
at least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide 
in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions 
whereby a hybridization complex is formed between said probe and said target polynucleotide or 
5 fragments thereof, and b) detecting the presence or absence of said hybridization complex, and 

optionally, if present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 

10 selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) amplifying said target polynucleotide or 

15 fragment thereof using polymerase chain reaction amplification, and b) detecting the presence or 
absence of said amplified target polynucleotide or fragment thereof, and, optionally, if present, the 
amount thereof. 

The invention further provides a composition comprising an effective amount of a polypeptide 
comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence 

20 selected from the group consisting of SEQ ID NO:l-27, b) a naturally occurring amino acid sequence 
having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-27, and a pharmaceutical^ acceptable excipient In one 

25 embodiment, the composition comprises an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27. The invention additionally provides a method of treating a disease or condition 
associated with decreased expression of functional ORP, comprising administering to a patient in need 
of such treatment the composition. 

The invention also provides a method for screening a compound for effectiveness as an 

30 agonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino 

35 acid sequence selected from the group consisting of SEQ ID NO:l-27. The method comprises a) 
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exposing a sample comprising the polypeptide to a compound, and b) detecting agonist activity in the 
sample. In one alternative, the invention provides a composition comprising an agonist compound 
identified by the method and a pharmaceutically acceptable excipient In another alternative, the 
invention provides a method of treating a disease or condition associated with decreased expression of 
functional ORP, comprising administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
; of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino 
- acid sequence selected from the group consisting of SEQ ID NO: 1 -27, and d) an immunogenic fragment 
,?.of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27. The method 
comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist activity in the sample. In one alternative, the invention provides a composition comprising 
an antagonist compound identified by the method and a pharmaceutically acceptable excipient. In 
another alternative, the invention provides a method of treating a disease or condition associated with 
overexpression of functional ORP, comprising administering to a patient in need of such treatment the 
composition. 

The invention further provides a method of screening for a compound that specifically binds 
to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-27, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-27. The method comprises a) combining 
. the polypeptide with at least one test compound under suitable conditions, and b) detecting binding 
of the polypeptide to the test compound, thereby identifying a compound that specifically binds to the 
polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1 -27, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27. The 
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method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 
test compound with the activity of the polypeptide in the absence of the test compound, wherein a 

5 change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:28-54, the method comprising a) 

10 exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the. treated biological sample with a probe comprising at least 20 

15 contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:28-54, ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, iii) a 
polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), 

20 and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, 
ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 

25 polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, iii) a 

polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), 
and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide comprises a fragment of 
a polynucleotide sequence selected from the group consisting of i)-v) above; c) quantifying the 
amount of hybridization complex; and d) comparing the amount of hybridization complex in the 

30 treated biological sample with the amount of hybridization complex in an untreated biological 

sample, wherein a difference in the amount of hybridization complex in the treated biological sample 
is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

35 Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
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clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding ORP. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of ORP. 
5 ^ * Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression 
patterns of each nucleic acid sequence as determined by northern analysis; diseases, disorders, or 
-conditions associated with these tissues; and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding ORP were isolated. 
10 Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 

polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

. r * DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
15 that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used heron is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
20 and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
25 as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
any machines, materials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described. 
All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 
30 connection with the invention. Nothing herein is to be construed as an admission that the invention is 
not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"ORP" refers to the amino acid sequences of substantially purified ORP obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
35 human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant 
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The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
ORP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of ORP either by directly interacting with ORP 
or by acting on components of the biological pathway in which ORP participates. 
5 An "allelic variant" is an alto-native form of the gene encoding ORP. Allelic variants may 

result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
10 Each of these types of changes may occur alone, or in combination with the others, one or more times in 
a given sequence. 

"Altered" nucleic acid sequences encoding ORP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as ORP or a 
polypeptide with at least one functional characteristic of ORP. Included within this definition are 

15 polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding ORP, and improper or unexpected hybridization to allelic variants, with a 
locus other than the normal chromosomal locus for the polynucleotide sequence encoding ORP. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino 
acid residues which produce a silent change and result in a functionally equivalent ORP. Deliberate 

20 amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, 

hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological 
or immunological activity of ORP is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and 
arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values may 

25 include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side chains 
having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 
and phenylalanine and tyrosine. 

The teams "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 

30 molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 

protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence 
to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known 

35 in the art 
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The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
ORP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of ORP either by 
directly interacting with ORP or by acting on components of the biological pathway in which ORP 
5 participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, FCab')^ and Fv fragments, which are capable of binding an epitopic determinant 
" Antibodies that bind ORP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
10 to immunize an animal (e.g„ a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 
synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 

- that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole 
"limpet hemocyanin (KLH). The coupled peptide is thai used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
15 makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

immunize a host animal, numerous regions of the protein may induce the production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 
protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. 
20 The term "antisense" refers to any composition capable of base-pairing with the "sense" 

(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2 -methoxyethyl sugars or 2*-methoxyethoxy sugars; or oligonucleotides having 
25 modified bases such as 5-methyl cytosine, 2-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 

- molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 

' translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
30 - designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic ORP, or of any oligopeptide thereof, to 
induce a specific immune response in appropriate animals or cells and to bind with specific antibodies. 
35 "Complementary" describes the relationship between two singte-sttanded nucleic acid 
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sequences that anneal by base-pairing. For example, 5 -AGT-3' pairs with its complement, 
3'-TCA-5'. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 

5 amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding ORP or fragments of ORP may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 

10 SDS), and other components (e g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied Biosystems, 
Foster City CA) in the 5' and/or the 3' direction, and resequence*!, or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 

15 fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 

20 protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 


25 


Arg 


His, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 


30 


Glu 


Asp, Gin, His 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




De 


Leu, Val 




Leu 


He, Val 


35 


Lys 


Arg, Gin, Glu 




Met 


Leu, lie 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 


40 


Trp 


Phe, Tyr 
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Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
5 backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

10 The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 

modifications of a polynucleotide sequence can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 
one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 

1 5 immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of ORP or the polynucleotide encoding ORP which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 

20 to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 
15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 

25 example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

30 A fragment of SEQ ID NO:28-54 comprises a region of unique polynucleotide sequence that 

specifically identifies SEQ ID NO:28-54, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:28-54 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish " 
SEQ ID NO:28-54 from related polynucleotide sequences. The precise length of a fragment of SEQ 

35 ID NO:28-54 and the region of SEQ ID NO:2E-54 to which the fragment corresponds are -routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment 
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A fragment of SEQ ID NO:l-27 is encoded by a fragment of SEQ ID NO:28-54. A fragment 
of SEQ ID NO: 1-27 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO:l-27. For example, a fragment of SEQ ID NO:l-27 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO: 1-27. The precise length of 
5 a fragment of SEQ ID NO:l-27 and the region of SEQ ID NO:l-27 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

A "full-length" polynucleotide sequence is one containing at least a translation initiation codon 
(e.g., methionine) followed by an open reading frame and a translation termination codoa A "full- 
10 length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 
15 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Pa-cent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e sequence 
20 alignment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
andRM. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
Ktuple=2, gap penalty^?, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 
25 selected as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
30 sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://ww.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
35 Sequences" can be accessed and used interactively at http7/www.ncbi.nlm.nih.gov/gorm>12.html. The 
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"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 
5 Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 
10 Expect: JO 

Word Size: J J 

Filter: on 

Percent identity may be measured ova: the length of an entire defined sequence, for example, as 
defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 
15 the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 1 00, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. * 
20 Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
25 the percentage of residue matches between at least two polypeptide sequences aligned using a 

standardized algorithm. Methods of polypeptide sequence alignment are well-known Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the charge and hydrophobicity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 
30 Pa-cent identity between polypeptide sequences may be determined using the default parameters 

of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e sequence alignment 
program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved' =5. The PAM250 matrix is selected as the default residue weight table. As with 
35 polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
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between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.12 
(Apr-21 -2000) with blastp set at default parameters. Such default parameters may be, for example: 
5 Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 3 
10 Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
15 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
20 chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
25 complementary strand through base pairing under defined hybridization conditions. Specific 

hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., 
30 binding between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1% (w/v) 
35 SDS, and about 100 ^ig/ml sheared, denatured salmon sperm DNA. 
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Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
undo- which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C Iowa- than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 
5 target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions 
for nucleic acid hybridization are well known and can be found in Sambrook, J. et al., 1989, Molecular 
Cloning: A Laboratory Manual. 2 nd ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
10 include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 houn 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 fig/ml. Organic solvent, such as 
15 formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNA:DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 
20 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support <e.g„ 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 
25 or their nucleic acids have been fixed). 

- The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
30 various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular 
and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of ORP which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 
35 ORP which is useful in any of the antibody production methods disclosed herein or known in the art. 
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The term "microarray" refers to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
5 The term "modulate" refers to a change in the activity of ORP. For example, modulation may 

cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of ORP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
10 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
15 sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
20 preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an ORP may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
25 cell type depending on the enzymatic milieu of ORP. 

"Probe" refers to nucleic acid sequences encoding ORP, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 'Primers" are 
30 short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base^pairing. The primer may then be extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
35 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
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be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 
5 Methods for preparing and using probes and primers are described in the references, for 

^example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, F.M. et al. (1987) Current Protocols in Molecular 

» Biology , Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al. (1990) PCR 
Protocols. A Guide to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs 
10 can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 

15 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
prima- selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 

20 and is thus useful for designing primers on a genome- wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Cento* for Genome Research, 
Cambridge MA) allows the user to input a "misprinting library," in which sequences to avoid as primer 
binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 

25 their respective sources and modified to meet the user's specific needs.) The PrimeGen program 

(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 
hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
^ Hence, this program is useful for identification of both unique and conserved oligonucleotides and 

30 polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

35 A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
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that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, suj^a. The term recombinant includes nucleic acids that have 
5 been altered solely by addition, substitution, or deletion of a portion of the nucleic acid Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence 
Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
10 expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5 f and 3' untranslated regions (UTRs). 
Regulatory elements interact with host or viral proteins which control transcription, translation, or RNA 
stability. 

15 "Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 

amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 

20 sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding ORP, or fragments thereof, or ORP itself, may comprise a bodily fluid; an extract from 

25 a cell, chromosome, organdie, or membrane isolated from a cell; a ceil; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 

30 of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 

example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope 
A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

35 removed from their natural environment and are isolated or separated, and are at least 60% free, 
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preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides by 
different amino acid residues or nucleotides, respectively. 
5 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 

^ chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
: trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to the collective pattern of gene expression by a particular cell type 
10 or tissue under given conditions at a given time. 

'Transformation" describes a process by which exogenous DN A is introduced into a recipient 
rcelL Transformation may occur under natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type 
15 - of host cell being transformed and may include, but is not limited to, bacteriophage or viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term 'transformed" cells 
includes stably transformed cells in which the inserted DNA is capable of replication either as an 
autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed 
cells which express the inserted DNA or RNA few limited periods of time, 
20 A "transgenic organism," as used herein, is any organism, including but not limited to 

animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
- of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
25 v a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
: vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants, and animals. The isolated DNA of the present invention can be 
^ introduced into the host by methods known in the art, for example infection, transfection, 
30 transformation or transconjugation. Techniques for transferring the DNA of the present invention 
^ into such organisms are widely known and provided in references such as Sambrook, J. et al. (19B9), 
supra , 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
35 nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 <May-07- 1999) 
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set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an "allelic" 
(as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significant 

5 identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
due to alternative splicing of exons during mRNA processing. The corresponding polypeptide may 
possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A polymorphic 

10 variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given 
species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 
which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative of, for example, a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 

15 least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

20 THE INVENTION 

The invention is based on the discovery of new human oxidoreductase proteins (ORP), the 
polynucleotides encoding ORP, and the use of these compositions for the diagnosis, treatment, or 
prevention of cell proliferative disorders including cancer; endocrine, metabolic, reproductive, 
neurological, viral, and autoimmune/inflammatory disorders. 

25 Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding ORP. 

Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide and 
nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which nucleic 
acids encoding each ORP were identified, and column 4 shows the cDNA libraries from which these 
clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA libraries. Clones 

30 for which cDNA libraries are not indicated were derived from pooled cDNA libraries. In some cases, 
GenBank sequence identifiers are also shown in column 5. The Incyte clones and GenBank cDN A 
sequences, where indicated, in column 5 were used to assemble the consensus nucleotide sequence of 
each ORP and are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 

35 column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
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polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical methods 
and in some cases, searchable databases to which the analytical methods were applied The methods of 
5 column 7 were used to characterize each polypeptide through sequence homology and protean motifs. 
~* The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 

* associated with nucleotide sequences encoding ORP. The first column of Table 3 lists the nucleotide 
¥ -SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1. These fragments are 
/"useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:28-54 and 

10 * to distinguish between SEQ ID NO:28-54 and related polynucleotide sequences. The polypeptides 
encoded by these fragments are useful, for example, as immunogenic peptides. Column 3 lists tissue 
^ categories which express ORP as a fraction of total tissues expressing ORP. Column 4 lists diseases, 
disorders, or conditions associated with those tissues expressing ORP as a fraction of total tissues 
expressing ORP. Column 5 lists the vectors used to subclone each cDNA library. 

15 The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 

from which cDN A clones encoding ORP were isolated. Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and .other descriptive information relevant to the cDNA libraries in column 2. 

SEQ ID NO:31 maps to chromosome 3 within the interval from 23.2 to 31.4 centiMorgans. 

20 SEQ ID NO:42 maps to chromosome 22 within the interval from 0 to 40.2 centiMorgans. SEQ ID 
NO:48 maps to chromosome 7 within the interval from 100.5 to 1 14.5 centiMorgans, to chromosome 
17 within the interval from 67.6 to 69.3 centiMorgans, and to chromosome 17 within the interval 
from 83.8 centiMorgans to the q terminus. SEQ ID NO:53 maps to chromosome 1 1 within the 
interval from 64.9 to 70.9 centiMorgans. 

25 The invention also encompasses ORP Variants. A preferred ORP variant is one which has at 

* least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence 
identity to the ORP amino acid sequence, and which contains at least one functional or structural 
characteristic of ORP. 

The invention also encompasses polynucleotides which encode ORP. In a particular 
30 ~ embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 
. the group consisting of SEQ ID NO:28-54, which encodes ORP. The polynucleotide sequences of SEQ 
ID NO:28-54, as presented in the Sequence Listing, embrace the equivalent RNA sequences, wherein 
occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is 
composed of ribose instead of deoxyribose. 
35 The invention also encompasses a variant of a polynucleotide sequence -encoding ORP. In 
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particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding ORP. A particular aspect of the invention encompasses a variant of a polynucleotide 
sequence comprising a sequence selected from the group consisting of SEQ ID NO:28-54 which has at 
5 least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide 

sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO:28-54. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of ORP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 

10 code, a multitude of polynucleotide sequences encoding ORP, some bearing minimal similarity to the 
polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 
invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 

15 occurring ORP, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode ORP and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring ORP under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding ORP or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 

20 codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 

particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons 
are utilized by the host. Other reasons for substantially altering the nucleotide sequence encoding ORP 
and its derivatives without altering the encoded amino acid sequences include the production of RNA 
transcripts having more desirable properties, such as a greater half-life, than transcripts produced from 

25 the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode ORP and ORP 
derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the synthetic 
sequence may be inserted into any of the many available expression vectors and cell systems using 
reagents well known in the art. Moreover, synthetic chemistry may be used to introduce mutations into 

30 a sequence encoding ORP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:28-54 and fragments thereof under various conditions of stringency. (See, e.g., WaW, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 

35 152:507-511.) Hybridization conditions, including annealing and wash conditions, are described in 
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"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 
embodiments of the inventioa The methods may employ such enzymes as the Klenow fragment of 
DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
5 Biosystems, Foster City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway 
NJ), or combinations of polymerases and proofreading exonucleases such as those found in the 
- ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 
preparation is automated with machines such as the MICROLAB 2200 liquid transfer system 
(Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 
10 800 thermal cycler (Applied Biosystems). Sequencing is then carried out using either the ABI 373 or 
377 DNA sequencing system (Applied Biosystems), the MEG AB ACE 1 000 DNA sequencing system 
-(Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting sequences are 
analyzed using a variety of algorithms which are well known in the art (See, e.g., Ausubd, F.M. 
(1997) Short Protocols in Molecular Biology , John Wiley & Sons, New York NY, unit 7.7; Meyers, 
15 R. A. (1 995) Molecular Biology and Biotechnology . Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding ORP may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory dements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
20 DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:31 8-322.) 

Another method, inverse PCR, uses primers that extend in diva-gent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
25 to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. - 
^(1991) PCR Methods Applic. 1:111-1 19.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
^are known in the art (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3053-3060). 
30 Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
*AJto C A) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, 
Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
35 GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 
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72°C. 

When screening for full-length cDN As, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) library 

5 does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5* 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 

10 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g„ GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 

15 which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode ORP may be cloned in recombinant DNA molecules that direct expression of ORP, or fragments . 
or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of the 
genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 

20 amino acid sequence may be produced and used to express ORP. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alto* ORP-encoding sequences for a variety of purposes including, but not 
limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 

25 oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 

mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARB REEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 

30 5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties of ORP, such as its biological or enzymatic activity or its ability to 
bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 

35 subjected to selection or screening procedures that identify those gene variants with the desired 
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properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
5 optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 

r T maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 

<- manner. 

In another embodiment, sequences encoding ORP may be synthesized, in whole or in part, 
10 , using chemical methods well known in the art. (See, e.g., Canithers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
, x ORP itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
, synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 
Creighton, T. (1984) Proteins. Structures and Molecular Properties . WH Freeman, New York NY, pp. 
15 55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 
using the ABI 431 A peptide synthesizer (Applied Biosystems). Additionally, the amino acid sequence 
of ORP, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring polypeptide. 
20 The peptide may be substantially purified by preparative high performance liquid 

chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra , pp. 28-53.) 
, In order to express a biologically active ORP, the nucleotide sequences encoding ORP or 

25 derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
s the necessary elements for transcriptional and translational control of the inserted coding sequence in a 
suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5* and 3* untranslated regions in the vector and in polynucleotide sequences 
v encoding ORP. Such elements may vary in their strength and specificity. Specific initiation signals 
30 ? may also be used to achieve more efficient translation of sequences encoding ORP. Such signals 
... include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 
sequences encoding ORP and its initiation codon and upstream regulatory sequences are inserted into 
the appropriate expression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
35 translational control signals including an in-frame ATG initiation codon should be provided by the 
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vector. Exogenous translational elements and initiation codons may be of various origins, both natural 
and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate 
for the particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

5 Methods which are well known to those skilled in the art may be used to construct expression 

vectors containing sequences encoding ORP and appropriate transcriptional and translational control 
elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombinatioa (See, e.g., Sambrook, J. et al. (1989) Molecular Cloninp, A Laboratory 
Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 

10 Current Protocols in Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding ORP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 

15 plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g„ Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra ; Ausubel, supra ; Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 
Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Natl. 

20 Acad. Sci. USA 91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Then 7:1937-1945; Takamatsu, 
N. (1987) EMBO J. 6:307-311; Comzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The 
McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81 :3655-3659; and Harrington, 

25 J.J. et al. (1997) Nat Genet 15:345-355.) Expression vectors derived from retroviruses, 

adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. Natl. Acad. Sci. 
USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. 

30 (1994) Mol. Immunol. 31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding ORP. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding ORP can be achieved using a 

35 multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid 
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(Life Technologies). Ligation of sequences encoding ORP into the vector's multiple cloning site 
disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 
transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 

5 deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. BioL Chem. 
264:5503-5509.) When large quantities of ORP are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of ORP may be used. For example, vectors containing the 
strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of ORP. A number of vectors containing 

10 constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH promoters, may be 
used in the yeast Saccharom vces cerevisiae or Pichia pastoris . In addition, such vectors direct either the 
secretion or intracellular retention of expressed proteins and enable integration of foreign sequences into 
the host genome for stable propagatioa (See, e.g., Ausubel, 1995, supra : Bitter; supra : and Scorer, 
supra .) 

5 Plant systems may also be used for expression of ORP. Transcription of sequences encoding 

ORP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 
used. (See, e.g., Coruzzi, supra : Broglie, supra: and Winter, supra .) These constructs can be 
introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, eg., 
The McGraw Hill Year book of Science and Technology (1 992) McGraw Hill, New York NY, pp. 
191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding ORP may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses ORP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. Natl. 
Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma virus 
(RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV-based 
vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods Oiposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g„ Harrington, J.J. et al. (1997) Nat Genet. 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
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ORP in cell lines is preferred. For example, sequences encoding ORP can be transformed into cell lines 
using expression vectors which may contain viral origins of replication and/or endogenous expression 
elements and a selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before being 
5 switched to selective media. The purpose of the selectable marker is to confer resistance to a selective 
agent, and its presence allows growth and recovery of cells which successfully express the introduced 
sequences. Resistant clones of stably transformed cells may be propagated using tissue culture 

techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include, 

10 but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphcribosyltransferase 
genes, for use in tkr and apr cells, respectively. (See, e.g., Wigler, M. et al. (1977) Cell 1 1:223-232; 
Lowy, I, et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to 
15 chlorsulfuronandphosphinotricm^ (See, e.g., Wigler, M. et al. (1980) 

Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 
Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), B 
20 glucuronidase and its substrate B-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. 
(1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
25 also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding ORP is inserted within a marker gene sequence, transformed cells containing 
sequences encoding ORP can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding ORP under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
30 expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding ORP and that express 
ORP may be identified by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizatioas, PCR amplification, and 
protein bioassay or immunoassay techniques which include membrane, solution, or chip based 
35 technologies for the detection and/or quantification of nucleic acid or protean sequences. 
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Immunological methods for detecting and measuring the expression of ORP using either 
specific polyclonal or monoclonal antibodies are known in the art Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs) t and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
5 antibodies reactive to two non-interfering epitopes on ORP is preferred; but a competitive binding assay 
- may be employed. These and other assays are well known in the art (See, e.g., Hampton, R. et al. 
— (1990) Serological Methods, a Laboratory Manual . APS Press, St Paul MN, Sect IV; Coligan, J.E. et 
al. (1997) Current Protocols in Immunology * Greene Pub. Associates and Wiley-Interscience, New 
1 York NY; and Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa NJ.) 
10 A wide variety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
*. or PCR probes for detecting sequences related to polynucleotides encoding ORP include oligolabeling, 
• ?i> nick translation, end-labeling, or PGR amplification using a labeled nucleotide. Alternatively, the 
sequences encoding ORP, or any fragments thereof, may be cloned into a vector for the production of 
15 an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to 
synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of commercially available 
kits, such as those provided by Amersham Pharmacia Biotech, Promega (Madison WI), and US 
Biochemical. Suitable reporter molecules or labels which may be used for ease of detection include 
20 radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, 
cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding ORP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
>< produced by a transformed cell may be secreted or retained intracellular^ depending on the sequence 
25 -s and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
" polynucleotides which encode ORP may be designed to contain signal sequences which direct secretion 
of ORP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
* inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 
30 polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host Gells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 
(e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 
35 Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
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of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding ORP may be ligated to a heterologous sequence resulting in translation of a fusion 
protein in any of the aforementioned host systems. For example, a chimeric ORP protein containing a 
5 heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of ORP activity. Heterologous protein and peptide moieties 
may also facilitate purification of fusion proteins using commercially available affinity matrices. Such 
moieties include, but are not limited to, glutathione S-transferase (GST), maltose binding protein 
(MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin 
10 (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion proteins on 
immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
respectively. FLAG, c-wyc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
15 located between the ORP encoding sequence and the heterologous protein sequence, so that ORP may 
be cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel (1995, supra , ch. 10). A variety of commercially 
available kits may also be used to facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled ORP may be achieved in 
20 vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or 
SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, 35 S-methionine. 

ORP of the present invention or fragments thereof may be used to screen for compounds that 
25 specifically bind to ORP. At least one and up to a plurality of test compounds may be screened for 
specific binding to ORP. Examples of test compounds include antibodies, oligonucleotides, proteins 
(e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
ORP, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
30 natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which ORP 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express ORP, either as a secreted 
35 protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila, or E. 
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coli . Cells expressing ORP or cell membrane fractions which contain ORP are then contacted with a 
test compound and binding, stimulation, or inhibition of activity of either ORP or the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
5 detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
^ the assay may comprise the steps of combining at least one test compound with ORP, either in 
solution or affixed to a solid support, and detecting the binding of ORP to the compound, 
.Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cellrfree preparations, chemical 
10 libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

> ORP of the present invention or fragments thereof may be used to screen for compounds that 

- modulate the activity of ORP. Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive for ORP 

15 activity, wherein ORP is combined with at least one test compound, and the activity of ORP in the 
presence of a test compound is compared with the activity of ORP in the absence of the test 
compound. A change in the activity of ORP in the presence of the test compound is indicative of a 
compound that modulates the activity of ORP. Alternatively, a test compound is combined with an in 
vitro or cell-free system comprising ORP under conditions suitable for ORP activity, and the assay is 

20 pa-formed. In either of these assays, a test compound which modulates the activity of ORP may do 
so indirectly and need not come in direct contact with the test compound. At least one and up to a 
plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding ORP or their mammalian homologs may 
be "knocked out" in an animal model system using homologous recombination in embryonic stem 

25 (ES) cells. Such techniques are well known in the art and are useful for the generation of animal 

models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No, 5,767,337.) For 
example, mouse ES cells, such as the mouse 1 29/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 
interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi,M.R. 

30 (1989) Science 244: 1 288-1 292). The vector integrates into the corresponding region of the host 

genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 
Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 

35 blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
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to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding ORP may also be manipulated in vitro in ES cells derived from 
5 human blastocysts. Human ES cells have the potential to differentiate into at least eight separate ceU 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1 145-1 147). 

Polynucleotides encoding ORP can also be used to create "knockin" humanized animals 

10 (pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology* a 
region of a polynucleotide encoding ORP is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the 
blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 

15 Alternatively, a mammal inbred to overexpress ORP, e.g., by secreting ORP in its milk, may also 
serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of ORP and oxidoreductase proteins. In addition, the expression of ORP is closely 

20 associated with cancerous, inflamed, reproductive, and gastrointestinal tissues. Therefore, ORP 
appears to play a role in cell proliferative disorders including cancer; endocrine, metabolic, 
reproductive, neurological, viral, and autoimmune/inflammatory disorders. In the treatment of 
disorders associated with increased ORP expression or activity, it is desirable to decrease the 
expression or activity of ORP. In the treatment of disorders associated with decreased ORP 

25 expression or activity, it is desirable to increase the expression or activity of ORP. 

Therefore, in one embodiment, ORP or a fragment or derivative thereof may be administered 
to a subject to treat or prevent a disorder associated with decreased expression or activity of ORP. 
Examples of such disorders include, but are not limited to, a cell proliferative disorder, such as actinic 
keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue 

30 disease, myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 
thrombocythemia; a cancer, such as adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, 
sarcoma, teratocarci noma, and, in particular, cancers of the adrenal gland, bladder, bone, bone 
marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, 
muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, 

35 thyroid, and uterus; an endocrine disorder, such as disorders of the hypothalamus and pituitary 
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resulting from lesions such as primary brain tumors, adenomas, infarction associated with pregnancy, 
hypophysectomy, aneurysms, vascular malformations, thrombosis, infections, immunological 
disorders, and complications due to head trauma, disorders associated with hypopituitarism including 
hypogonadism, Sheehan syndrome, diabetes insipidus, Kallman's disease, Hand-Schuller-Christian 
5 disease, Letterer-Siwe disease, sarcoidosis, empty sella syndrome, and dwarfism, disorders associated 
^ with hyperpituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic 

- hormone (ADH) secretion (SIADH) often caused by benign adenoma, disorders associated with 
hypothyroidism including goiter, myxedema, acute thyroiditis associated with bacterial infection, 
subacute thyroiditis associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), 

10 - and cretinism, disorders associated with hyperthyroidism including thyrotoxicosis and its various 
-forms, Grave's disease, pretibial myxedema, toxic multinodular goiter, thyroid carcinoma, and 
-Plummer's disease, disorders associated with hyperparathyroidism including Conn disease (chronic 
~~ hypercalemia), pancreatic disorders such as Type I or Type II diabetes mellitus and associated 
complications, disorders associated with the adrenals such as hyperplasia* carcinoma, or adenoma of 

15 the adrenal cortex, hypertension associated with alkalosis, amyloidosis, hypokalemia, Cushing's 
disease, Liddle's syndrome, and Arnold-Healy-Gordon syndrome, pheochromocytoma tumors, and 
Addison's disease, disorders associated with gonadal steroid hormones such as: in women, abnormal 
prolactin production, infertility, endometriosis, perturbations of the menstrual cycle, polycystic 
ovarian disease, hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, galactorrhea, 

20 . hermaphroditism, hirsutism and virilization, breast cancer, and, in postmenopausal women, 

osteoporosis, and, in men, Leydig cell deficiency, male climacteric phase, and germinal cell aplasia, 
hypergonadal disorders associated with Leydig cell tumors, androgen resistance associated with 
absence of androgen receptors, syndrome of 5 a-reductase, and gynecomastia; a metabolic disorder, 
such as Addison's disease, cystic fibrosis, diabetes, fatty hepatocirrhosis, galactosemia, goiter, 

25 ; hyperadrenalism, hypoadrenalism, hyperparathyroidism, hypoparathyroidism, hypercholesterolemia, 
r hyperthyroidism, hypothyroidism hyperlipidemia, hyperlipemia, lipid myopathies, obesity, 

- lipodystrophies, and phenylketonuria, congenital adrenal hyperplasia, pseudovitamin D-deficiency 
rickets, cerebrotendinous xanthomatosis, and coumarin resistance; a reproductive disorder, such as 

~ disorders of prolactin production, infertility, including tubal disease, ovulatory defects, and 
30 - endometriosis, disruptions of the estrous cycle, disruptions of the menstrual cycle, polycystic ovary 
syndrome, ovarian hyperstimulation syndrome, endometrial and ovarian tumors, uterine fibroids, 
autoimmune disorders, ectopic pregnancies, and teratogenesis; cancer of the breast, fibrocystic breast 
disease, and galactorrhea, disruptions of spermatogenesis, abnormal sperm physiology, cancer of the 
testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, 
35 carcinoma of the male breast, and gynecomastia; a neurological disorder, such as epilepsy, ischemic 
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cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic 
lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis 
pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral 

5 meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 

thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases 
including kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal 
familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 

10 retardation and other developmental disorders of the central nervous system/cerebral palsy, 

neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia 
gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, 

15 seasonal affective disorder (SAD), akathesia, amnesia, catatonia* diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, and Tourette's disorder; an 
autoimmune/inflammatory disorder, such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, 
anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 

20 autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes 
mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' 
disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 

25 myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 
polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 
systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, 
ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 

30 trauma; and a viral disorder, such as viral infections, e.g., those caused by adenoviruses (acute 
respiratory disease, pneumonia), arenaviruses (lymphocytic choriomeningitis), bunyaviruses 
(Hantavirus), coronaviruses (pneumonia, chronic bronchitis), hepadnaviruses (hepatitis), 
herpesviruses (herpes simplex virus, varicella-zoster virus, Epstein-Barr virus, cytomegalovirus), 
flaviviruses (yellow fever), orthomyxoviruses (influenza), papillomaviruses (cancer), 

35 paramyxoviruses (measles, mumps), picornoviruses (rhinovirus, poliovirus, coxsackie-virus), 
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polyomaviruses (BK vims, JC virus), poxviruses (smallpox), reovinis (Colorado tick fever), 
retroviruses (human immunodeficiency virus, human Tlymphotropic virus), rhabdovi.ruses (rabies), 
rotaviruses (gastroenteritis), and togaviruses (encephalitis, rubella). 

In another embodiment, a vector capable of expressing ORP or a fragment or derivative thereof 
5 may be administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of ORP including, but not limited to, those described above. 

In a further, embodiment, a composition comprising a substantially purified ORP in conjunction 
^ with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent a disorder 
associated with decreased expression or activity of ORP including, but not limited to, those provided 
10 - above. 

^ In still another embodiment, an agonist which modulates the activity of ORP may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
- of ORP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of ORP may be administered to a subject to treat or 
15 prevent a disorder associated with increased, expression or activity of ORP. Examples of such 

disorders include, but are not limited to, those cell proliferative disorders including cancer; endocrine, 
metabolic, reproductive, neurological, viral, and autoimmune/inflammatory disorders described above. 
In one aspect, an antibody which specifically binds ORP may be used directly as an antagonist or 
indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissues 
20 which express ORP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding ORP may be administered to a subject to treat or prevent a disorder associated with increased 
expression or activity of ORP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
25 sequences, or vectors of the invention may be administered in combination with other appropriate 

therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
30 lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of ORP may be produced using methods which are generally known in the art. 
In particular, purified ORP may be used to produce antibodies or to screen libraries of pharmaceutical 
agents to identify those which specifically bind ORP. Antibodies to ORP may also be generated using 
methods that are well known in the art Such antibodies may include, but are not limited to, polyclonal, 
35 monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab 
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expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are generally 
preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with ORP or with any fragment or oligopeptide thereof 
5 which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

10 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to ORP 

have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 
identical to a portion of the amino acid sequence of the natural protein. Short stretches of ORP amino 
acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule 

15 may be produced. 

Monoclonal antibodies to ORP may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., KoMer, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

20 Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, SJL. et al. (1984) Proc. 

25 Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce ORP-specific single chain 
antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be generated 
by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g„ Burton, D.R-. 

30 (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

35 Antibody fragments which contain specific binding sites for ORP may also be generated. For 
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example, such fragments include, but are not limited to, FCab 1 ^ fragments produced by pepsin digestion 
of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F<ab , )2 
fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 
5 (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 

- specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 

* immunoassays typically involve the measurement of complex formation between ORP and its specific 
10 antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering ORP epitopes is generally used, but a competitive binding assay may also be employed 
(Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies few ORP. Affinity is expressed as an association 
15 constant, 1Q, which is defined as the molar concentration of ORP- antibody complex divided by the 

molar concentrations of free antigen and free antibody under equilibrium conditions. The* determined 
for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple ORP 
epitopes, represents the average affinity, or avidity, of the antibodies for ORP. The determined for a 

- preparation of monoclonal antibodies, which are monospecific for a particular ORP epitope, represents 
20 a true measure of affinity. High-affinity antibody preparations with ranging from about 10 9 to 10 12 

L/mole are preferred for use in immunoassays in which the ORP-antibody complex must withstand 
rigorous manipulations. Low-affinity antibody preparations with ranging from about 10 6 to 10 7 
L/mole are preferred for use in immunopurification and similar procedures which ultimately require 
; dissociation of ORP, preferably in active form, from the antibody (Catty, D. (1988) Antibodies, 
25 Volume I: A Practical Approach . IRL Press, Washington DC; Liddell, J.E. and A. Cryer (1991) A 
: Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
r polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
30 specific antibody/ml, is generally employed in procedures requiring precipitation of ORP-antibody 
- complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available. {See, e.g. t Catty, supra , and 
Coligan et al., supra .) 

In another embodiment of the invention, the polynucleotides encoding ORP, or any fragment or 
35 complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
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expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding ORP. 
Such technology is well known in the art, and antisense oligonucleotides or larger fragments can be 
designed from various locations along the coding or control regions of sequences encoding ORP. (See, 
5 e.g., Agrawal, S., ed. (1996) Antisense Therapeutics * Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 

10 Slater, JJE. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(13):1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g„ Miller, A.D. (1990) Blood 
76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 

15 systems known in the art. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 5 l(l):217-225; Boado, R.J. et 
al. (1998) J. Pharm. Sci. 87(1 1):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding ORP may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 

20 (e.g., in the cases of severe combined immunodeficiency (SCID)-X1 disease characterized by X-linked 
inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 

25 Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, LM. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (eg., 

30 against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; PoescWa, E. e* al. (1996) Proc. Natl. Acad. Sci. USA. 93:1 1395-1 1399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis : and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi) . In the 
case where a genetic deficiency in ORP expression or regulation causes disease, the expression of ORP 

35 from an appropriate population of transduced cells may alleviate the clinical manifestations caused by 
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the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in ORP 
are treated by constructing mammalian expression vectors encoding ORP and introducing these vectors 
by mechanical means into ORP-deficient cells. Mechanical transfer technologies for use with cells in 
5 vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold particle 
— delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the use of 

* DNA transposons (Morgan, R.A and W.F. Anderson (1993) Annu. Rev. Biochem. 62:191-217; Ivies, 

* Z. (1997) Cell 91:501-510; Boulay, J-L. and H. R&ipon (1998) Curr. Opia Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of ORP include, but are not limited 
10 to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
: PTET-ON, FTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). ORP may be expressed 
using (i) a constitutiv&y active promoter, (e.g„ from cytomegalovirus (CMV), Rous sarcoma virus 
(RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (e.g M the 
15 tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA 

89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) 
Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promote:; or the RU486/mifepristone inducible promoter (Rossi, F.M. V. 
20 and H.M. Blau, supra) ), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding ORP from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
25 parameters. In the alternative* transformation is performed using the calcium phosphate method 
" (Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by dectroporation (Neumann, E. et al. 
(1982) EMBO J. 1 :841-845). The introduction of DNA to primary ceils requires modification of these 
standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
30 ' respect to ORP expression are treated by constructing a retrovirus vector consisting of ti) the 
A polynucleotide encoding ORP under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cw-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
35 commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
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Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in an 
appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VS Vg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. ViroL 61:1639-1646; Adam, MA. and 
5 A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880), U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses a 
method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 
Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T-cells), and the 

10 return of transduced cells to a patient are procedures well known to persons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71 :4707-4716; Ranga, U. et al. 
(1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 

15 polynucleotides encoding ORP to cells which have one or more genetic abnormalities with respect to the 
expression of ORP. The construction and packaging of adenovirus-based vectors are well known to 
those with ordinary skill in the art Replication defective adenovirus vectors have proven to be versatile 
for importing genes encoding immunoregulatory proteins into intact islets in the pancreas (Csete, M.E. 
et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are described in U.S. 

20 Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), hereby incorporated 
by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) Annu. Rev. Nutr. 
19:511-544; and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both incorporated by 
reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
25 polynucleotides encoding ORP to target cells which have one or more genetic abnormalities with respect 
to the expression of ORP. The use of herpes simplex virus (HS V)-based vectors may be especially 
valuable for introducing ORP to cells of the central nervous system, for which HSV has a tropism. The 
construction and packaging of hopes-based vectors are well known to those with ordinary skill in the 
art A replication-competent hopes simplex virus (HSV) type 1 -based vector has been used to deliver a 
30 reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 169:385-395). The 

construction of a HSV-1 virus vector has also been disclosed in detail in U.S. Patent Number 5,804,413 
to DeLuca ("Herpes simplex virus strains for gene transier"), which is hereby incorporated by 
reference. U.S. Patent Number 5,804,41 3 teaches the use of recombinant HSV d92 which consists of a 
genome containing at least one exogenous gene to be transferred to a cell under the control of the 
35 appropriate promoter for purposes including human gene therapy. Also taught by this patent are the 
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construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. For HSV 
vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 andXu, H. et al. (1994) Dev. Biol. 
163:152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus sequences, the 
generation of recombinant vims following the transfection of multiple plasmids containing different 
5 segments of the large herpesvirus genomes, the growth and propagation of herpesvirus, and the 

- infection of cells with herpesvirus are techniques well known to those of ordinary skill in the art 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding ORP to target cells. The biology of the prototypic alphavirus, Semliki 
• Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on the SFV 
10 genome (Garoff, H. and YL-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During alphavirus RNA 
]*■ replication, a subgenomic RNA is generated that normally encodes the viral capsid proteins. This 

- subgenomic RNA replicates to higher levels thaii the full-length genomic RNA, resulting in the 
overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g., protease 

- and polymerase). Similarly, inserting the coding sequence for ORP into the alphavirus genome in place 
15 of the capsid-coding region results in the production of a large number of ORP-coding RNAs and the 

synthesis of high levels of ORP in vector transduced cells. While alphavirus infection is typically 
associated with cell lysis within a few days, the ability to establish a persistent infection in hamster 
normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that the lytic replication 
of alphaviruses can be altered to suit the needs of the gene therapy application (Dryga, S. A. et al. 
20 (1 997) Virology 228:74-83). The wide host range of alphaviruses will allow the introduction of ORP 
into a variety of cell types. The specific transduction of a subset of cells in a population may require 
the sorting of cells prior to transduction. The methods of manipulating infectious cDNA clones of 
alphaviruses, pa-forming alphavirus cDNA and RNA transfections, and pa-forming alphavirus 

- infections, are well known to those with ordinary skill in the art. 

25 Oligonucleotides derived from the transcription initiation site* e.g., between about positions -10 

and +10 from the start site, may also be employed to inhibit gene expressioa Similarly, inhibition can 
be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
* transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 

30 - been described in the literature. (See, e g., Gee, J.E. et al. (1994) in Huber, BJE. and B.I. Carr, 
Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

35 RNA The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
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molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecyles may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding ORP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

5 scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 

10 oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prq>ared by 
any method known in the art for the synthesis of nucleic acid molecules. These include techniques fear 
chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 

15 encoding ORP. Such DNA sequences may be incorporated into a wide variety of vectors with suitable 
RNA polymerase promoters such asT7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3' ends 

20 of the molecule, or the use of phosphorothioate or 2* O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in aU of these molecules by the inclusion of nontraditiorial bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

25 An additional embodiment of the invention encompasses a method for screening for a 

compound which is effective in altering expression of a polynucleotide encoding ORP. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 

30 chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased ORP 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding ORP may be therapeutically useful, and in the treament of disorders associated with 

35 decreased ORP expression or activity, a compound which specifically promotes expression of the 
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polynucleotide encoding ORP may be therapeutically useful. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
5 altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
j> based on chemical and/or structural properties of the target polynucleotide; and selection from a 
, library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding ORP is exposed to at least one test compound thus obtained. The sample 
10 may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
vt biochemical system. Alterations in the; expression of a polynucleotide encoding ORP are assayed by 
, any method commonly known in the art Typically, the expression of a specific nucleotide is detected 
„ by hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encoding ORP. The amount of hybridization may be quantified, thus forming the 
15 basis for a comparison of the expression of the polynucleotide both with and without exposure to one 
or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a 
test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example, using a Schizosaccharomvces pombe ^ene expression system (Atkins, 
20 D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, CM. et al. (2000) Nucleic Acids Res. 28:E15) or a 
human cell line such as HeLa cell (Clarke, ML. et al. (2000) Biochem. Biophys. Res. Commun. 
268:8-13). A particular embodiment of the present invention involves screening a combinatorial 
library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and 
modified oligonucleotides) for antisense activity against a specific polynucleotide sequence <Bruice, 
25 T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 6,022,1591). 

Many methods for introducing vectors into cells or tissues are available and equally suitable for 
use in vivo, in vitro, and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
30 using methods which are well known in the art (See, e.g., Goldman, C.K. et al. (1997) Nat 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

35 ^ additional embodiment of the invention relates to the administration of a composition which 
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generally comprises an active ingredient formulated with a pharmaceutical^ acceptable excipient. 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of ORP, 
5 antibodies to ORP, and mimetics, agonists, antagonists, or inhibitors of ORP. 

The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 
10 Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

These compositions are generally aerosolized immediately prior to inhalation by the patient. In the case 
of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-acting 
formulations is well-known in the art In the case of macromolecules (e.g. larger peptides and proteins), 
recent developments in the field of pulmonary delivery via the alveolar region of the lung have enabled 
15 the practical delivery of drugs such as insulin to blood circulation (see, e.g„ Patton, J.S. et al., U.S. 
Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without needle 
injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination of 
20 an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising ORP or fragments thereof. For example, liposome preparations containing 
a cell-impermeable macromotecule may promote cell fusion and intracellular delivery of the 
macromolecule. Alternatively, ORP or a fragment thereof may be joined to a short cationic N-terminal 
25 portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to transduce into 
the cells of all tissues, including the brain, in a mouse model system (Schwarze* S.R. et al. (1999) 
Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g„ of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
30 or pigs. An animal model may also be used to determine the appropriate concentration range and route 
of administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example ORP or 
fragments thereof, antibodies of ORP, and agonists, antagonists or inhibitors of ORP, which 
35 ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
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standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED^ (the dose therapeutically effective in 50% of the population) or LDso (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the l^D 50 fED y> ratio. Compositions which exhibit large 
5 therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
v used to formulate a range of dosage for human use. The dosage contained in such compositions is 
; preferably within a range of circulating concentrations that includes the EDso with little or no toxicity. 
Hie dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 
10 The exact dosage will be determined by the practitioner, in light of factors related to the subject 

requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug combination(s), reaction sensitivities, and response to therapy. 
15 Long-acting compositions may be administered every 3 to 4 days, every week, or biweekly depending 
on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 g to 1 00,000 g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
20 Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind ORP may be used for the diagnosis 
25 k of disorders characterized by expression of ORP, or in assays to monitor patients being treated with 
< ORP or agonists, antagonists, or inhibitors of ORP. Antibodies useful for diagnostic purposes may be 
prepared in the same manna- as described above for therapeutics. Diagnostic assays for ORP include 
methods which utilize the antibody and a label to detect ORP in human body fluids or in extracts of 
cells or tissues. The antibodies may be used with or without modification, and may be labeled by 
30- covalent or non-covalent attachment of a reporter molecule. A wide variety of reporter molecules, 
t several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring ORP, including ELISAs, RIAs, and FACS, are known in 
the art and provide a basis for diagnosing altered or abnormal levels of ORP expression. Normal or 
standard values for ORP expression are established by combining body fluids or cell extracts taken 
35 from normal mammalian subjects, for example, human subjects, with antibody to ORP under conditions 
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suitable for complex formation. The amount of standard complex formation may be quantitated by 
various methods, such as photometric means. Quantities of ORP expressed in subject, control, and 
disease samples from biopsied tissues are compared with the standard values. Deviation between 
standard and subject values establishes the parameters for diagnosing disease. 
5 In another embodiment of the invention, the polynucleotides encoding ORP may be used for 

diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of ORP may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
10 ORP, and to monitor regulation of ORP levels during therapeutic interventioa 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding ORP or closely related molecules may be used to 
identify nucleic acid sequences which encode ORP. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g.. a 
15 conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding ORP, allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the ORP encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:28-54 or from 
20 genomic sequences including promoters, enhancers, and introns of the ORP gene. 

Means for producing specific hybridization probes for DNAs encoding ORP include the cloning 
of polynucleotide sequences encoding ORP or ORP derivatives into vectors for the production of 
mRNA probes. Such vectors are known in the art, are commercially available, and may be used to 
synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerases and the 
25 appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter groups, 
for example, by radionuclides such as M P or 35 S, or by enzymatic labels, such as alkaline phosphatase 
coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding ORP may be used for the diagnosis of disorders associated 
with expression of ORP. Examples of such disorders include, but are not limited to, a cell proliferative 
30 disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease, myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, 
psoriasis, primary thrombocythemia; a cancer, such as adenocarcinoma, leukemia, lymphoma, 
melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, 
bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, 
35 kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, 
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testis, thymus, thyroid, and uterus; an endocrine disorder, such as disorders of the hypothalamus and 
pituitary resulting from lesions such as primary brain tumors, adenomas, infarction associated with 
pregnancy, hypophysectomy, aneurysms, vascular malformations, thrombosis, infections, 
immunological disorders, and complications due to head trauma, disorders associated with 
hypopituitarism including hypogonadism, Sheehan syndrome, diabetes insipidus, Kallman's disease, 
Hand-Schuller-Christian disease, Letterer-Siwe disease, sarcoidosis, empty sella syndrome, and 
dwarfism, disorders associated with hyperpituitarism including acromegaly, giantism, and syndrome 
of inappropriate antidiuretic hormone (ADH) secretion (SIADH) often caused by benign adenoma, 
disorders associated with hypothyroidism including goiter, myxedema, acute thyroiditis associated 
with bacterial infection, subacute thyroiditis associated with viral infection, autoimmune thyroiditis 
(Hashimoto's disease), and cretinism, disorders associated with hyperthyroidism including 
thyrotoxicosis and its various forms, Grave's disease, pretibial myxedema, toxic multinodular goiter, 
thyroid carcinoma, and Plummer's disease, disorders associated with hyperparathyroidism including 
Conn disease (chronic hypercalemia), pancreatic disorders such as Type I or Type II diabetes mellitus 
and associated complications, disorders associated with the adrenals such as hyperplasia, carcinoma, 
or adenoma of the adrenal cortex, hypertension associated with alkalosis, amyloidosis, hypokalemia, 
Cushing's disease, Liddle's syndrome/ and Arnold-Healy-Gordon syndrome, pheochromocytom a 
tumors, and Addison's disease, disorders associated with gonadal steroid hormones such as: in 
women, abnormal prolactin production, infertility, endometriosis, perturbations of the menstrual 
cycle, polycystic ovarian disease, hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, 
galactorrhea, hermaphroditism, hirsutism and virilization, breast cancer, and, in post-menopausal 
women, osteoporosis, and, in men, Leydig cell deficiency, male climacteric phase, and germinal cell 
aplasia, hypergonadal disorders associated with Leydig cell tumors, androgen resistance associated 
with absence of androgen receptors, syndrome of 5 a -reductase, and gynecomastia; a metabolic 
disorder, such as Addison's disease, cystic fibrosis, diabetes, fatty hepatocirrhosis, galactosemia, 
goiter, hyperadrenalism, hypoadrenalism, hyperparathyroidism, hypoparathyroidism, 
hypercholesterolemia, hypothyroidism, hypothyroidism hyperlipidemia, hyperlipemia, lipid 
myopathies, obesity, lipodystrophies, and phenylketonuria, congenital adrenal hyperplasia, 
pscudovitamin D-deficiency rickets, cerebrotendinous xanthomatosis, and coumarin resistance; a 
reproductive disorder, such as disorders of prolactin production, infertility, including tubal disease, 
ovulatory defects, and endometriosis, disruptions of the estrous cycle, disruptions of the menstrual 
cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, endometrial and ovarian 
tumors, uterine fibroids, autoimmune disorders, ectopic pregnancies, and teratogenesis; cancer of the 
breast, fibrocystic breast disease, and galactorrhea, disruptions of spermatogenesis, abnormal sperm 
physiology, cancer of the testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, 
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Peyronie's disease, impotence, carcinoma of the male breast, and gynecomastia; a neurological 
disorder, such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 

5 muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 

10 nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 

15 endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette's disorder; an autoimmune/inflammatory disorder, such as acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, 

20 allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune 

hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal 
dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, 
dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's 

25 syndrome, gout, Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel 
syndrome, multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, 
osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid 
arthritis, scleroderma, SjSgren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, 
systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 

30 complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, 
parasitic, protozoal, and helminthic infections, and trauma; and a viral disorder, such as viral 
infections, e.g., those caused by adenoviruses (acute respiratory disease, pneumonia), arenaviruses 
(lymphocytic choriomeningitis), bunyaviruses (Hantavirus), coronaviruses (pneumonia, chronic 
bronchitis), hepadnaviruses (hepatitis), herpesviruses (herpes simplex virus, varicella-zoster virus, 

35 Epstein-Barr virus, cytomegalovirus), flaviviruses (yellow fever), orthomyxoviruses (influenza), 
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papillomaviruses (cancer), paramyxoviruses (measles, mumps), picornovi ruses (rhinovirus, 
poliovirus, coxsackie-virus), polyoma viruses (BK virus, JC vims), poxviruses (smallpox), reo virus 
(Colorado tick fever), retroviruses (human immunodeficiency virus, human T lymphotropic vims), 
rhabdoviruses (rabies), rotaviruses (gastroenteritis), and togaviruses (encephalitis, rubella). The 
polynucleotide sequences encoding ORP may be used in Southern or northern analysis, dot blot, or 

- other membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered ORP expression. 
Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding ORP may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding ORP may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 

• incubation period, the sample is washed and the signal is quantified and compared with a standard 
value. If the amount of signal in the patient sample is significantly altered in comparison to a control 
sample then the presence of altered levels of nucleotide sequences encoding ORP in the sample indicates 
the presence of the associated disorder. Such assays may also be used to evaluate the efficacy of a 
particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment 
of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of ORP, a 
normal or standard profile for expression is established This may be accomplished by combining body 
fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding ORP, under conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained from normal subjects with values 
from an experiment in which a known amount of a substantially purified polynucleotide is used. 
Standard values obtained in this manner may be compared with values obtained from samples from 
patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
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clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
" preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding ORP may 

5 involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, or 
produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding ORP, or 
a fragment of a polynucleotide complementary to the polynucleotide encoding ORP, and will be 
employed under optimized conditions for identification of a specific gene or condition. Oligomers may 
also be employed undo* less stringent conditions for detection or quantification of closely related DNA 

10 or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding ORP may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, 
insertions and deletions that are a frequent cause of inherited or acquired genetic disease in humans. 
Methods of SNP detection include, but are not limited to, single-stranded conformation polymorphism 

15 (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from the 
polynucleotide sequences encoding ORP are used to amplify DNA using the polymerase chain reaction 
(PCR). The DNA may be derived, for example, from diseased or normal tissue, biopsy samples, bodily 
fluids, and the like. SNPs in the DNA cause differences in the secondary and tertiary structures of PCR 
products in single-stranded form, and these differences are detectable using gel electrophoresis in non- 
20 denaturing gels. In fSCCP, the oligonucleotide primers are fluorescently labeled, which allows 
detection of the amplimers in high-throughput equipment such as DNA sequencing machines. 
Additionally, sequence database analysis methods, termed in silico SNP (isSNP), are capable of 
identifying polymorphisms by comparing the sequence of individual overlapping DNA fragments which 
assemble into a common consensus sequence. These computer-based methods filter out sequence 

25 variations due to laboratory preparation of DNA and sequencing errors using statistical models and 
automated analyses of DNA sequence chromatograms. In the alternative, SNPs may be detected and 
characterized by mass spectrometry using, for example, the high throughput MASSARRAY system 
(Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of ORP include radiolabeling or 

30 biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 

standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometry or colorimetric response gives rapid 

35 quantitation. 
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In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as dements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described in Seilhamer, J J. et al., "Comparative Gene Transcript 
5 Analysis," U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
* used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
10 to develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 
treatment regimen for that patient For example, therapeutic agents which are highly effective and 
display the fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, antibodies specific for ORP, or ORP or fragments thereof may be used 
as elements on a microarray. The microarray may be used to monitor or measure protein-protein 
15 interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at a 
20 given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent Number 

5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
25 invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo , as in the 
case of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 
30 Transcript images which profile the expression of the polynucleotides of the present invention 

may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
35 (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Andereon (2000) 
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Toxicol. Lett. 1 12-1 1 3:467-471, expressly incorporated by reference herein). If a test compound has a 
signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most useful and refined when they contain expression information 
from a large number of genes and gene families. Ideally, a genome-wide measurement of expression 

5 provides the highest quality signature. Even genes whose expression is not altered by any tested 
compounds are important as well, as the levels of expression of these genes are used to normalize the 
rest of the expression data. The normalization procedure is useful for comparison of expression data 
after treatment with different compounds. While the assignment of gene function to elements of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 

10 necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 
example, Press Release 00-02 from the National Institute of Environmental Health Sciences, released 
February 29, 2000, available at httpy/ww.mehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is 
important and desirable in toxicological screening using toxicant signatures to include all expressed 
gene sequences. 

15 In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 

containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 
20 levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 
pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 
25 can be subjected individually to further analysis. Proteome expression patterns, or profiles, are 
analyzed by quantifying the number of expressed proteins and their relative abundance under- given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 
achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 
30 isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra). The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an agent 
such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
generally proportional to the level of the protein in the sample. The optical densities of equivalently 
35 positioned protein spots from different samples, for example, from biological samples either treated or 
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untreated with a test compound or therapeutic agent, are compared to identify any changes in proton 
spot density related to the treatment The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 
5 at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for ORP to quantify the 
- levels of ORP expression. In one embodiment, the antibodies are used as elements on a microarray, and 
protein expression levels are quantified by exposing the microarray to the sample and detecting the 
10 levels of protein bound to each array element (Liieking, A. et al. (1999) Anal. Biochem 270:103-1 1 1 ; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
5* methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 
reactive fluorescent compound and detecting the amount of fluorescence bound at each array element 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 
15 be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N.L. and J. 
Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
20 degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated biological 
sample are separated so that the amount of each protein can be quantified. The amount of each protein 
is compared to the amount of the corresponding protein in an untreated biological sample. A difference 
25 in the amount of protein between the two samples is indicative of a toxic response to the test compound 
* z in the treated sample. Individual proteins are identified by sequencing the amino acid residues of the 
individual proteins and comparing these partial sequences to the polypeptides of the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins from the biological sample are incubated 
30 ~ with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 
^ by the antibodies is quantified. The amount of protein in the treated biological sample is compared with 
the amount in an untreated biological sample. A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art. {See, e.g., 
35 Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
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USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 
known and thoroughly described in DNA Microarrays: A Practical Approach , M. Schena, ed. (1999) 
5 Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding ORP may be used to 
generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 

10 of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, JJ. et al. (1997) Nat. 

15 Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 
7:149-154,) Once mapped, the nucleic acid sequences of the invention may be used to develop genetic 
linkage maps, for example, which correlate the inheritance of a disease state with the inheritance of a 
particular chromosome region or restriction fragment length polymorphism (RFLP). (See, e.g., 
Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

20 Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 

data. (See, e.g„ Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic map 
data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding ORP on a physical map 
and a specific disorder, or a predisposition to a specific disorder, may help define the region of DNA 

25 associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even if the exact chromosomal locus is not known. This information is 

30 valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely localized 
by genetic linkage to a particular genomic region, e.g„ ataxia-telangiectasia to 1 lq22-23, any sequences 
mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g., 
Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may 

35 also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 

57 

BNSDOCID: <WO 01 44448A2_L> 



WO 01/44448 PCT/US00/33158 

among normal, carrier, or. affected individuals. 

In another embodiment of the invention, ORP, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
5 solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
^ between ORP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
~ having suitable binding affinity to the protein of interest (See, e.g., Geysen, et al. (1 984) PCT 
* application WO84/03564.) In this method, large numbers of different small test compounds are 
10 synthesized on a solid substrate. The test compounds are reacted with ORP, or fragments thereof; and 
washed. Bound ORP is then detected by methods well known in the art. Purified ORP can also be 
coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively; 
4X ' non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
15 antibodies capable of binding ORP specifically compete with a test compound for binding ORP. In this 
manner, antibodies can be used to detect the presence of any peptide which shares one or more antigenic 
determinants with ORP. 

In additional embodiments, the nucleotide sequences which encode ORP may be used in any 
molecular biology techniques that have yet to be developed, provided the new techniques rely on 
20 properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
25 of the disclosure in any way whatsoever. 

" - The disclosures of all patents, applications, and publications mentioned above and below, in 

5 particular U.S. Ser. No. 60/172,367, are hereby expressly incorporated by reference. 

EXAMPLES 

30 I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some tissues 
were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed 
in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
35 cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol 
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or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 

5 Chatsworth C A), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g„ the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries ware constructed with the UNIZ AP 

10 vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 

recommended procedures or similar methods known in the art. (See, e>g„ Ausubel, 1997, supra, units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 

15 bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 

chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g„ 
PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDN A2. 1 plasmid 
(Invitrogen, Carlsbad C A), or pINC Y plasmid (Incyte Genomics, Palo Alto C A). Recombinant 

20 plasmids were transformed into competent E. coli cells including XL1 -Blue, XL 1 -BlueMRF, or SOLR 
from Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 
II. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo excision 
using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using at least 

25 one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC 
Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid 
purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0. 1 ml of 
distilled water and stored, with or without lyophilization, at 4 C. 

30 Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 

high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 

35 (Labsystems Oy, Helsinki, Finland). 
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III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
5 (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 

^MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 
- the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labded polynucleotides were 
10 carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI 
PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art Reading 
-frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, 
su^ra, unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 
15 disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 
references, and threshold parameters. The first column of Table 5 shows the tools, programs, and 
20 algorithms used, the second column provides brief descriptions thereof, the third column presents 

appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 
25 Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which 
also calculates the percent identity between aligned sequences. 

The polynucleotide sequences were validated by removing vector, linker, and poly A sequences 
30 and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 

programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a 
selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 
eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 
using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into full 
35 length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened 
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for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, 
and these full length sequences were subsequently analyzed by querying against databases such as the 
GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
5 and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a 
probabilistic approach which analyzes consensus primary structures of gene families. (See,e.g., 
Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide and 
amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ IP 
10 NO:28-54. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 
and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
15 particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel, 1995, 
supra , ch. 4 and. 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
20 search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 

5 x minimum {length(Seq. 1), length(Seq. 2)} 

25 The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 
assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 

30 every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment. For 
examples product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared. A product score of 70 is produced either by 1 00% identity and 

35 70% overlap at one end, or by 88% identity and 1 00% overlap at the other. A product score of 50 is 
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produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% ova-lap. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding ORP occurred. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
5 developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
- reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
•* Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 
10 3. 

^ V. Chromosomal Mapping of ORP Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO:28-54 were compared with 
* sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
15 SEQ ID NO:28-54 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Cento- (SHGC), Whitehead Institute for 
Genome Research (WIGR), and G6n6thon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
20 of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

* The genetic map locations of SEQ ID NO:31 , SEQ ID NO:42, SEQ ID NO:48, and SEQ ID 

NO:53 are described in The Invention as ranges, or intervals, of human chromosomes. More than one 
. map location is reported for SEQ ID NO:48, indicating that previously mapped sequences having 
similarity, but not complete identity, to SEQ ID NO:48 were assembled into their respective clusters. 
25, : The map position of an interval, in centiMorgans, is measured relative to the terminus of the 
chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination 
frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase 
(Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) 
The cM distances are based on genetic markers mapped by GdnSthon which provide boundaries for 
3CL radiation hybrid markers whose sequences were included in each of the clusters. Human genome 
, }t maps and other resources available to the public, such as the NCBI M GeneMap'99 ,f World Wide Web 
site (httpy/www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VI. Extension of ORP Encoding Polynucleotides 
35 The full length nucleic acid sequences of SEQ ID NO:28-54 were produced by extension of an 
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appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment One primer was synthesized to initiate 5* extension of the known fragment, and the other 
primer, to initiate 3* extension of the known fragment. The initial primers ware designed using OLIGO 
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 

5 in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 

temperatures of about 68 C to about 72 C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one extension 
was necessary or desired, additional or nested sets of primers were designed. 

10 High fidelity amplification was obtained by PGR using methods well known in the art. PGR 

was performed in 96- well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DN A template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and p-mercaptoethanol, Taq DN A polymerase (Amersham Pharmacia Biotech), ELONG ASE enzyme 
(Life Technologies), and Pfii DN A polymerase (Stratagene), with the following parameters for primer 

15 pair PCI A and PCI B: Step 1: 94 C, 3 min; Step 2: 94 C, 15 sec; Step 3: 60 C, 1 min; Step 4: 68 C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 C, 5 min; Step 7: storage at 4 C. In the 
altonative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94 C, 3 min; Step 2: 
94 C, 15 sec; Step 3: 57 C, 1 min; Step 4: 68 C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 C, 5 min; Step 7: storage at 4 C. 

20 The concentration of DNA in each well was determined by dispensing 100 fal PICOGREEN 

quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 nl of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 

25 concentration of DNA. A 5 ItolO 1 aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 

30 shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose 
gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 

35 media, and individual colonies were picked and cultured overnight at 37 C in 384-well plates in LB/2x 
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carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham 
Pharmacia Biotech) and Pfii DNA polymerase (Stratagene) with the following parameters: Step 1 : 
94 C, 3 min; Step 2: 94 C f 15 sec; Step 3: 60 C, 1 min; Step 4: 72 C, 2 min; Step 5: steps 2, 3, and 4 
5 repeated 29 times; Step 6: 72 C, 5 min; Step 7: storage at 4 C. DNA was quantified by PICOGREEN 
"~ reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 
using the same conditions as described above. Samples were diluted with 20% dimethysulfoxide (1 :2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
- DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 
10 sequencing ready reaction kit (Applied Biosystems). 
r " In like manner, the polynucleotide sequences of SEQ ID NO:28-54 are used to obtain 5 9 

regulatory sequences using the procedure above, along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 
VII. Labeling and Use of Individual Hybridization Probes 
15 Hybridization probes derived from SEQ ID NO:28-54 are employed to screen cDNAs, genomic 

DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is 
specifically described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 Ci of [y- 32 P] adenosine 
20 triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonueleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or 
25 Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & SchueU, Durham NH). Hybridization is carried out for 16 
hours at 40 C To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
30 ' Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

VIII. Microarrays 

The linkage or synthesis of array dements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschwealer, supra) , mechanical 
35 microspotting technologies, and derivatives thereof The substrate in each of the aforementioned 
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technologies should be uniform and solid with a non-porous surface (Schena (1999), supra). Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
5 produced using available methods and machines well known to those of ordinary skill in the art and may 
contain any appropriate number of elements. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 
16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
10 comprise the elements of the microarray. Fragments ot obgoinere ^ 

selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
15 fluorescence scanner is used to detect hybridization at each array element Alteniatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described in 
detail below. 
20 Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/ul oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/ul RNase inhibitor, 500 uM dATP, 500 uM dGTP, 500 uM dTTP, 40 pM 
25 dCTP, 40 uM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 
GEMB RIGHT kits (Incyte). Specific control poly(A)* RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
30 incubated for 20 minutes at 85 C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
35 resuspended in 14 ul 5X SSC/0.2% SDS. 
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Micro array Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array elements are 
5 amplified in thirty cycles of PCR from an initial quantity of 1 -2 ng to a final quantity greater than 5 
^ Mg. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
~* Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
^ slides (Corning) are cleaned by ultrasound in 0. 1 % SDS and acetone, with extensive distilled water 
10 6 washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 

Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
coated with 0.05% aminopropyl silane (Sigma) in 95% ethanoi. Coated slides are cured in a 1 10°C 
- oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
15 Patent No. 5,807,522, incorporated herein by reference. 1 pi of the array element DNA, at an average 

concentration of 100 ng/|jl, is loaded into the open capillary printing element by a high-speed robotic 

apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
20 Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 

buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 C followed by washes in 

0.2% SDS and distilled water as before. 

Hybridization 

Hybridization reactions contain 9 pi of sample mixture consisting of 0.2 \xg each of Cy3 and 
25 Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

mixture is heated to 65 C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 nl of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for 
30 about 6.5 hours at 60 C. The arrays are washed for 10 min at 45 C in a first wash buffo- (IX SSC, 
0.1% SDS), three times for 10 minutes each at 45 C in a second wash buffer (0.1X SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
35 at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
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focused on the airay using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1 .8 cm x 1 .8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 
5 In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
10 typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
15 location to be correlated with a weight ratio of hybridizing species of 1 :100,000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
fluorophores and adding identical amounts of each to the hybridization mixture. 
20 The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 

(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
25 measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
30 for signal analysis is the GEMTOOLS gene expression analysis program Gncyte). 
IX. Complementary Polynucleotides 

Sequences complementary to the ORP-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring ORP. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
35 smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
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4.06 software (National Biosciences) and the coding sequence of ORP. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5* sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the ORP-encoding transcript 
5 X. Expression of ORP 

Expression and purification of ORP is achieved using bacterial or virus-based expression 
y systems, For expression of ORP in bacteria, cDNA is subcloned into an appropriate vector containing 
an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA transcription. 
Examples of such promoters include, but are not limited to, the irp-lac (tac) hybrid promoter and the 
10 T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory element. 

Recombinant vectors are transformed into suitable bacterial hosts, e.g„ BL21(DE3). Antibiotic 
resistant bacteria express ORP upon induction with isopropyl beta-D-thiogalactopyranoside <IPTG). 
Expression of ORP in eukaryotic cells is achieved by infecting insect or mammalian cell lines with 
recombinant Autographica californica nuclear polyhedrosis virus (AcMNPV), commonly known as 

15 baculovirus. The nonessential polyhedrin gene of baculovirus is replaced with cDNA encoding ORP by 
either homologous recombination or bacterial-mediated transposition involving transfer plasmid 
intermediates. Viral infectivity is maintained and the strong polyhedrin promoter drives high levels of 
cDNA transcription. Recombinant baculovirus is used to infect Spodoptera frugiperda (Sf9) insect 
cells in most cases, or human hepatocytes, in some cases. Infection of the latter requires additional 

20 genetic modifications to baculovirus. (See Engelhard, E.K. et al. (1994) Proc. Natl. Acad. Sci. USA 
91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Then 7:1937-1945.) 

In most expression systems, ORP is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 

25 enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from ORP at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 

30 His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 

(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 
ch. 10 and 16). Purified ORP obtained by these methods can be used directly in the assays shown in 
Examples XI and XV. 
XI. Demonstrati n of ORP Activity 

35 For purposes of example, an assay demonstrating the activity of a short-chain alcohol 
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dehydrogenase is described. Essentially the same method is used for other types of oxidoreductases, 
with suitable substitution of the substrate and electron acceptor. ORP activity is demonstrated by the 
oxidation of NADPH to NADP in the presence of substrate (Kunau and Dommes (1978) Eur. J. 
Biochem. 91 :533-544). Substrates include, but are not limited to, all-trans-retinaldehyde and cis-4- 

5 dienoyl-CoA. ORP is preincubated for 10 minutes at 37 C in 60 |iM potassium phosphate (pH 7.4), 
125 nM NADPH, and 0.2 pM CoA (coenzyme A). The reaction is initiated by addition of the 
appropriate substrate (12.5 to 150 \M final concentration). The change in absorbance of the reaction 
at 340 ran, due to the oxidation of NADPH to NADP, is measured using a spectrophotometer at 23 
°C. Units of ORP activity are expressed as pinoles of NADP formed per minute. A reaction lacking 

10 ORP is used as a negative control. 

Alternatively, ORP activity is assayed by measuring the reduction of insulin. Aliquots of 
ORP are preincubated at 37 °C for 20 min with 2 pi of 50 mM Hepes, pH 7.6, 100 pg/ml bovine 
serum albumin, and 2 mMDTT in a total volume of 70 pi. Then, 40 pi of a reaction mixture 
composed of 200 pi of Hepes (1 M), pH 7.6, 40 pi of EDTA (0.2 M), 40 pi of NADPH (40 mg/ml), 

15 and 500 pi of insulin (10 mg/ml) is added. The reaction is initiated with the addition of 10 pi of 

thioredoxin reductase from calf thymus (3.0 A412 unit), and incubation is continued for 20 min at 37 
°C. The reaction rate is followed by monitoring the oxidation of NADPH at 412 nM. The oxidation 
of NADPH is proportional to the amount of insulin reduction. 
XII. Functional Assays 

20 ORP function is assessed by expressing the sequences encoding ORP at physiologically 

elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDN A expression. Vectors of choice 
include pCMV SPORT plasmid (Life Technologies) and pCR3.1 plasmid (Invitrogen), both of which 
contain the cytomegalovirus promoter. 5-10 g of recombinant vector are transiently transfected into a 

25 human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 

formulations or electroporation. 1-2 g of an additional plasmid containing sequences encoding a 
marker protein are co- transfected. Expression of a marker protein provides a means to distinguish 
transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 

30 Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
based technique, is used to identify traasfected cells expressing GFP or CD64-GFP and to evaluate the 
apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
fluorescent molecules that diagnose events preceding or coincident with cell death. These events include 
changes in nuclear DNA content as measured by staining of DNA with propidium iodide; changes in 

35 cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down- 
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regulation of DN A synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; 
and alterations in plasma membrane composition as measured by the binding of fluorescein-conjugated 
Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. 
5 (1994) Flow Cytometry , Oxford, New York NY. 

The influence of ORP on gene expression can be assessed using highly purified populations of 
- cells transfected with sequences encoding ORP and either CD64 or CD64-GFP. CD64 and CD64-GFP 
J - are expressed on the surface of transfected cells and bind to conserved regions of human 
* immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
10 magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY). 
mRNA can be purified from the cells using methods well known by those of skill in the art. Expression 
of mRNA encoding ORP and other genes of interest can be analyzed by northern analysis or microarray 
r techniques. 

XIIL Production of ORP Specific Antibodies 
15 ORP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 

Harrington, M,G. (1990) Methods EnzymoL 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the ORP amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
20 synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 

selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art (See, e.g., Ausubel, 1995, supra , ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- Aldrich, 
25" St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
* immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-ORP 
activity by, for example, binding the peptide or ORP to a substrate, blocking with 1 % BS A, reacting 
with rabbit antisera, washing, and reacting with radioiodinated goat anti-rabbit IgG. 
30 XIV. Purification of Naturally Occurring ORP Using Specific Antibodies 

Naturally occurring or recombinant ORP is substantially purified by immunoaffinity 
chromatography using antibodies specific for ORP. An immunoaffinity column is constructed by 
covalently coupling anti-ORP antibody to an activated chromatographic resin, such as CNBr-activated 
SEPH AROSE (Amersham Pharmacia Biotech). After the coupling, the resin is blocked and washed 
35 according to the manufacturer's instructions. 
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Media containing ORP are passed over the inimunoaffinity column, and the column is washed 
under conditions that allow the preferential absorbance of ORP (e.g., high ionic strength buffers in the 
presence of detergent). The column is eluted under conditions that disrupt antibody/ORP binding (e.g. t 
a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), and 
5 ORP is collected. 

XV. Identification of Molecules Which Interact with ORP 

ORP, or biologically active fragments thereof, are labeled with Bolton-Hunter reagent. 
(See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 1 33:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled ORP, washed, and 
10 any wells with labeled ORP complex are assayed. Data obtained using different concentrations of ORP 
are used to calculate values for the number, affinity, and association of ORP with the candidate 
molecules. 

Alternatively, molecules interacting with ORP are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
15 available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

ORP may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 

20 

Various modifications and variations of the described methods and systems of the invention will 
be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
25 Indeed, various modifications of the described modes for carrying out the invention which are obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1 -27, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, and 

10 d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

of SEQ ID NO: 1-27. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO:28-54. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

25 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 9. A method for producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1, and 

35 b) recovering the polypeptide so expressed. 
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10. An isolated antibody which specifically binds to a polypeptide of claim 1. 

1 1. An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

5 a) a polynucleotide sequence selected from the group consisting of SEQ ID NO: 28-54, 

b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 
10 e) an RN A equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 1 1 * ■ 

13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 11, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof . 

14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

16. A composition comprising an effective amount of a polypeptide of claim 1 and a 
pharmaceutical^ acceptable excipient 

35 
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17. A composition of claim 16, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27. 

18. A method for treating a disease or condition associated with decreased expression of 
functional ORP, comprising administering to a patient in need of such treatment the composition of 
claim 16. 

19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutically acceptable excipient 

21 . A method for treating a disease or condition associated with decreased expression of 
functional ORP, comprising administering to a patient in need of such treatment a composition of 
claim 20. 

22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 
of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

23. A composition comprising an antagonist compound identified by a method of claim 22 
and a pharmaceutically acceptable excipient 

24. A method for treating a disease or condition associated with overexpression of functional 
ORP, comprising administering to a patient in need of such treatment a composition of claim 23. 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1 , said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
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compound that specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1 , said method comprising: 

5 a) combining the polypeptide of claim 1 with at least one test compound under conditions 

permissive for the activity of the polypeptide of claim 1 , 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
10 with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 

in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1 . 

27. A method for screening a compound for effectiveness in altering expression of a target 
15 polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 

comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under conditions 
suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

20 c) comparing the expression of the target polynucleotide in the presence of varying amounts of 

the compound and in the absence of the compound. 

28. A method for assessing toxicity of a test compound, said method comprising: 
a) treating a biological sample containing nucleic acids with the test compound; 

25 b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 

least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof; 

30 c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 
compound. 
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SEQUENCE LISTING 



<110> INCYTE GENOMICS, INC. 
YUE, Henry 
LAL, Preeti 
TANG, Y. Tom 
HILLMAN, Jennifer 
BAUGHN, Mariah R . 
AZIMZAI, Yalda 
LU, Dyung Aina M. 

<120> HUMAN OXIDOREDUCTASE PROTEINS 

<130> PF-0754 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/172,367 
<151> 1999-12-16 

<160> 54 

<170> PERL Program 

<210> 1 

<211> 468 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 543496CD1 

<400> 1 

Met Ala Ala Arg Leu Val Ser Arg Cys Gly Ala Val Arg Ala Ala 
15 10 15 

Pro His Ser Gly Pro Leu Val Ser Trp Arg Arg Trp Ser Gly Ala 
20 25 30 

Ser Thr Asp Thr Val Tyr Asp Val Val Val Ser Gly Gly Gly Leu 
35 40 45 

Val Gly Ala Ala Met Ala Cys Ala Leu Gly Tyr Asp lie His Phe 
50 55 60 

His Asp Lys Lys lie Leu Leu Leu Glu Ala Gly Pro Lys Lys Val 
-65 70 75 

Leu Glu Lys Leu Ser Glu Thr Tyr Ser Asn Arg Val Ser Ser lie 
.80 85 90 

Ser Pro Gly Ser Ala Thr Leu Leu Ser Ser Phe -Gly Ala Trp Asp 
-95 100 105 

His lie Cys Asn Met Arg Tyr Arg Ala Phe Arg Arg Met Gin Val 
110 115 120 

Trp Asp Ala Cys Ser Glu Ala Leu lie Met Phe Asp Lys Asp Asn 
125 130 135 

Leu Asp Asp Met Gly Tyr lie Val Glu Asn Asp Val lie Met His 
140 145 150 

Ala Leu Thr Lys Gin Leu Glu Ala Val Ser Asp Arg Val 'Thr Val 
155 160 165 

Leu Tyr Arg Ser Lys Ala lie Arg Tyr Thr Trp Pro Cys Pro Phe 
170 175 180 

Pro Met Ala Asp Ser Ser Pro Trp Val His. lie Thr Leu Gly Asp 
185 190 195 

Gly Ser Thr Phe Gin Thr Lys Leu Leu lie Gly Ala Asp Gly His 
200 205 210 

Asn Ser Gly Val Arg Gin Ala Val Gly lie Gin Asn Val Ser Trp 
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Ser 


Arg 


Asn 


Leu 


Val 










170 








lie 


Asn 


Pro 


Leu 


Leu 


Ser 


He 


Leu 










185 








Arg 


Glu 


Gin 


Ala 


Asn 


Lys 


Arg Asn 










200 








Lys 


Leu 


Phe 


Tyr 


Ser 


Ala 


Lys 


Asn 










215 








Lys 


Asn 


He 


Leu 


Asp 


Leu 


Val 


Asn 










230 








Cys 


Ser 


Leu 


His 


Val 


Thr 


Lys 


Gin 










245 






Leu 


Lys 


Pro 


Tyr 


He 


Thr 


Glu 


Gly 










260 








Arg 


Asp 


His 


He 


Ser 


Lys 


Glu 


Thr 










275 








Pro 


Pro 


Met 


Thr 


Asp 


Phe 


Phe 


Ser 










290 








Val 


Pro 


Lys 


Glu 


His 


He 


Cys 


Phe 










305 







<210> 8 

<211> 160 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1865603CD1 

<400> 8 

Met Leu Arg His Leu Pro Ser Arg 

1 5 
Arg Thr Leu Glu Lys Gin Ser Trp 
20 

Pro Pro Cys Leu He Arg Arg Leu 

35. 

Lys Ser He Lys Asp Thr Thr Met 
50 

Met Glu Val Met Thr Phe Lys Glu 
65 



Pro 


Glv 


Leu 


Leu Arg 


Cvs 


Ser 




10 








15 


Ala 


Ser 


Leu 


Arg Leu 


Thr 


Leu 




25 








30 


Thr 


Ser 


He 


Met Lys 


Ser 


Lys 




40 








45 


Thr 


Ala 


Ser 


Val Leu 


Arg 


Arg 




55 








60 


Cys 


Glv 


Ala 


Ala Ser 


Glu 


Ser 




70 








75 


Leu 


Val 


Ala 


Asp Gin 


Asp 


Phe 




85 








9 0 


Asp 


Phe 


Phe 


He Pro 


Glv 
\* xy 


Val 




100 








105 


Cys 


Ser 


Ser 


Pro Arg 


Leu 


Leu 




115 








120 


Ala 


Val 


Lys 


Tyr Thr 


Asn 






130 








135 


Thr 


Cys 


Thr 


Leu Asp 


Cys 






145 








150 


Phe 


Phe 


Phe 


Asp Pro 


Gin 


XT i. V/ 




160 








X O ZJ 


Leu 


He 


Ala Gly Gly Val 


at vr 




175 








-LOU 


AL vj 


nxo 


Ala Ala Asp 


Leu 


Leu 




1 V w 










C2~l vr 

Vj .L 


xyr 


Glu 


He Gly Thr 


x xe 




205 










XXIX 


C 1 >- 


Glu 


Leu Leu 


Phe 


Lys 




220 








Zj 


Cll 1 1 




Pro 


Glu Lys 


He 


nla 




J Zj 










XII JL 


J. LIJL 


Gin 


He Asn 


Ala 






250 












Tip 
lie 


Thr 


Glu Lys 


GlU 


lie 




265 








270 


Leu 


Phe 


Tyr 


He Cys 


Gly 


Pro 




280 








285 


Lys 


Gin 


Leu 


Glu Asn 


Asn 


His 




295 








300 


Glu 


Lys 


Trp 


Trp 








310 











Leu 


Pro 


Val 


Lys 


Met Trp 


Gly 




10 








15 


Arg 


Asp 


Ser 


Ser 


Gin Thr 


Pro 




25 








30 


Asp 


His 


He 


Val 


Met Thr 


Val 




40 








45 


Phe 


Tyr 


Ser 


Lys 


He Leu 


Gly 




55 








60 


Asp 


Arg 


Lys 


Ala 


Leu Cys 


Phe 




70 








75 
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Gly 


Asp 




Lys 


rTie 

O A 


Asn 


Leu 


His 


Glu 


Pro 


Lys 


Ala 


nia 


rtxs 


Pro 


v CLX 


Pro 


Gly 








Q 










He 


Thr 


GlU 


Val 


Pro 

J. xu 


Leu 


GlU 


Glu 


Met 


Cys 


Asp 


Val 


Pro 


He 


Glu 


GlU 


Gly Pro 






125 










Lys 


Gly 


Pro 


He 


Met 


Ser 


He 


Tyr 


Phe 






140 










Leu 


He 


Glu 


Val 


Ser 
155 


Asn 


Tyr 


He 


Ser 



PCTYUS00/33158 



V CLX 




Lys 


Glu 


Phe 


Glu 


85 






90 


Ser 


Leu 


Asp 


He 


Cvs 


Leu 


100 










105 


lie 


Gin 


His 


Leu 


Lvs 


Ala 


115 










120 


Val 


Pro 


Arg 


Thr 


Gly 


Ala 


130 










135 


Arg Asp 


Pro 


Asp 


Arg 


Asn 


145 










150 


Ser 












160 













<210> 9 

<211> 487 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1976472CD1 

<400> 9 

Met Asp Trp He Arg Leu Thr Lys 

1 5 
Leu Lys Gly Arg Leu He Glu Val 
20 

His Asn Lys Lys Asp Asp Cys Trp 
35 

Tyr Asn Val Ser Pro Tyr Met Glu 
50 

Glu Leu Met Arg Ala Ala Gly Ser 
65 

Gin Val His Arg Trp Val Asn Tyr 
80 

Leu Val Gly Arg Met Ala He Lys 
95 

Arg Glu Glu Glu Lys Lys Val Leu 
110 

Gin Val Thr Asp Thr Leu Ala Lys 
125 

Tyr Asp Trp Phe Gin Thr Asp Ser 
140 

Thr Lys Gin Lys Asp He Asn Leu 
155 

Gin Asn Asp Ser Phe Arg Ala Glu 
170 

Tyr Leu He His He Gly Leu Ser 
185 

Ser Val Arg Val Val Glu Ser Val 
200 

Gin Lys Lys Glu Asn Thr Ser Trp 
215 

Lys Asn His Asn Ser Leu He Pro 
230 

Tyr Arg Lys Cys Gin Leu He Ser 
245 

Thr Arg Leu Phe Cys Leu Met Leu 
260 

Val Pro He Gly Gin His Val Tyr 
275 

Thr Glu He Val Lys Pro Tyr Thr 
290 

Ser Glu Phe Lys Glu Pro Val Leu 



Ser 


Gly Lys 


Asp 


Leu 


Thr 


Gly 




10 










15 


Thr 


Glu 


Glu 


Glu 


Leu 


Lys 


Lys 




25 










30 


lie 


Cys 


He Arg Gly Phe 


Val 




40 










45 


Tyr 


His 


Pro 


Gly Gly Glu 


Asp 


55 










60 


Ion 
Asp 


Gly 


Thr 


Glu 


Leu 


Phe 


Asp 


70 










75 


GlU 


Ser 


Met 


Leu 


Lys 


Glu 


Cys 




85 










90 


Pro 


Ala 


Val 


Leu 


Lys 


Asp 


Tyr 




100 










105 


Asn 


Gly Met 


Leu 


Pro 


Lys 


ser 




115 










120 


Glu 


Gly 


Pro 


Ser 


Tyr 


Pro 


Ser 




130 










135 


Leu 


Val 


Thr 


He 


Ala 


He 


Tyr 




145 










150 


Asp 


Ser 


lie 


He 


Val 


Asp 


His 


160 










165 


Thr 


He 


He 


Lys 


Asp 


Cys 


Leu 




175 










180 


His 


Glu 


Val 


Gin 


Glu 


Asp 


Phe 




190 










195 


Gly 


Lys 


He 


Glu 


He 


Val 


Leu 


205 










210 


Asp 


Phe 


Leu Gly His 


Pro 


Leu 


220 










225 


Arg 


Lys 


Asp 


Thr 


Gly Leu Tyr 




235 










240 


Lys 


Glu Asp Val 


Thr 


His 


Asp 


250 










255 


Pro 


Pro 


Ser 


Thr 


His 


Leu 


Gin 




265 










270 


Leu 


Lys 


Leu 


Pro 


He 


Thr 


Gly 




280 










285 


Pro 


Val 


Ser 


Gly 


Ser 


Leu 


Leu 




295 










300 


Pro 


Asn 


Asn 


Lys 


Tyr 


He 


Tyr 
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305 



irne 


Leu 


T 1 *a 

i le 


Lys 


x le 
320 


Tyr 


Pro 


Asp 


Arg 


Leu 


pit, 


lie 
335 




Asp 


t»iy 


AS n 


rne 


Lys 


lie 
350 




Lys 


Leu 


Leu 


Ala 


Ala 


Cjiy 
365 


Thr 


vjiy 


Asn 


Tyr 


Ala 


Leu 


Thr 
380 


Asp 


Tie 
116 


Met 


iriie 


file 


Asn 


Lys 
395 


inr 


olu 


Leu 


Q»J.U 


Lys 


Leu . 


Ala 
41U 


rne 


Lys 


vai 


Leu 


Ser 


Ala 


Pro 


i le 


ber 


lie 


Ser 


Pro 


Ala 


Leu 
440 


Leu 


Ser 


Lys 


Ser 


Lys 


Val 


Leu 
455 


Val 


Cys 


Glu 


Gin 


Gly Val 


Arg 


Leu 


Leu 










470 






Glu 


He 


His 


Ser 


Phe 


Thr 


Ala 



485 







■Jin 

sJ A. \J 










•J X -J 


Thr Gly 


Leu 


Phe 


Thr 


Pro 


Glu 


Leu 






325 










330 


Phe 


Val 


Ser 


Val 


Ser 


Ser 


Pro 


Glu 






340 










345 


Phe 


Gin 


Glu 


Leu 


Glu 


Asp 


Leu 


Phe 






355 










360 


Phe 


Thr 


Pro 


Met 


Val 


Lvs 


He 


Leu 






370 










375 


n w 


Ser 


Leu 


Arg 


Lys 


Val 


Lys 


Leu 






-J O J 










3 90 




Asp 


He 


He 


Trp 


Arg 


Ser 


Gin 






Ann 










405 

r± \J 




Lys 


Arg 


Leu Asp 


Val 


Glu 


Phe 






415 










420 


GlU 


Trp 


Asn 


Gly Lys 


Gin Gly 


His 






430 










435 


Glu 


Phe 


Leu 


Lys 


Arg 


Asn 


Leu 


Asp 






445 










450 


He 


Cys Gly 


Pro 


Val 


Pro 


Phe 


Thr 






460 










465 


His 


Asp 


Leu 


Asn 


Phe 


Ser 


Lys 


Asn 




475 










480 



<210> 10 
<211> 524 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No; 2050821CD1 

<400> 10 



Met 


Ser 


Leu 


Leu 


Ser 


Leu 


Pro 


Trp 


Leu 


Gly 


Leu Arg 


Pro Val 


Ala 


1 








5 










10 






15 


Met 


Ser 


Pro 


Trp 


Leu 


Leu 


Leu 


Leu 


Leu 


Val 


Val Gly 


Ser Trp 


Leu 








20 










25 






30 


Leu 


Ala 


Arg 


He 


Leu 


Ala 


Trp 


Thr 


Tyr 


Ala 


Phe Tyr 


Asn Asn 


Cys 








35 








40 






45 


Arg 


Arg 


Leu 


Gin 


Cys 


Phe 


Pro 


Gin 


Pro 


Pro 


Lys Arg 


Asn Trp 


Phe 






50 










55 






€0 


Trp 


Gly His 


Leu 


Gly 


Leu 


He 


Thr 


Pro 


Thr 


Glu Glu Gly Leu 


Lys 








65 










70 






75 


Asp 


Ser 


Thr 


Gin 


Met 


Ser 


Ala 


Thr 


Tyr 


Ser 


Gin Gly 


Phe Thr 


Val 








80 










85 






90 


Trp 


Leu Gly 


Pro 


He 


He 


Pro 


Phe 


He 


Val 


Leu Cys 


His Pro 


Asp 








95 










100 






105 


Thr 


He 


Arg 


Ser 


He 


Thr 


Asn 


Ala 


Ser 


Ala 


Ala He 


Ala Pro 


Lys 








110 










115 






120 


Asp 


Asn 


Leu 


Phe 


He 


Arg 


Phe 


Leu 


Lys 


Pro 


Trp Leu -Gly Glu Gly 








125 








130 






135 


He 


Leu 


Leu 


Ser 


Gly 
140 


Gly 


Asp 


Lys 


Trp 


Ser 
145 


Arg His 


Arg Arg 


Met 
150 


Leu 


Thr 


Pro 


Ala 


Phe 


His 


Phe 


Asn 


He 


Leu 


Lys Ser 


Tyr He 


Thr 










155 










160 




res 


He 


Phe 


Asn 


Lys 


Ser 


Ala 


Asn 


He 


Met 


Leu 


Asp Lys 


Trp <51n 


His 








170 










175 






180 


Leu 


Ala 


Ser 


Glu Gly 


Ser 


Ser 


Arg 


Leu 


Asp 


Met Phe 


Glu His 


He 










185 










190 






195 


Ser 


Leu 


Met 


Thr 


Leu 


Asp 


Ser 


Leu 


Gin 


Lys 


Cys He 


Phe Ser 


Phe 










200 








205 






210 
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Asp 


Ser 


His 


Cys 


Gin 


Glu 


Arg 


Leu 


Glu 


Leu 


Ser 


Ala 
o t n 


Leu 


Val 


Gin 


His 


Met 


Asp 


Phe 

1 A C 


Leu 


Tyr 


Phe 


His 


Arg 


Ala 


Cys 


Arg 


Leu 


He 


Arg 


Glu 


Arg 


Arg 
on r 


Arg 


Thr 


Phe 


Phe 


Lys 


Asp 


Lys 


Ala 


Lys 


Val 


Leu 


Leu 


Leu 


Ser 
one 


Lys 


Asp 


Glu 


Asp 


He 


Arg 


Ala 
*j *> n 


Glu 


Ala 


Asp 


Thr 


Thr 


Ala 


ser 

•5 7 c 


Gly 


Leu 


Arg 


His 


Pro 


Glu 


Tyr 

J jU 


Gin 


Glu 


Leu 


Leu 


Lys 


Asp 


Arg 

Job 


Asp 


Pro 


Ala 


Gin 


Leu 


Pro 


Phe 

O O A 


Leu 


Thr 


Leu 


His 


Pro 


Pro 


Ala 

O ft CT 

3 95 


Pro 


Phe 


He 


Val 


Leu 


Pro 


Asp 
410 


Gly 


Arg 


Leu 


He 


Asp 


He 


lie 
425 


Gly 


Val 


Asp 


Pro 


GlU 


Val 


Tyr 
440 


Asp 


Pro 


Lys 


Gly 


Arg 


Ser 


Pro 
455 


Leu 


Ala 


Arg 


Asn 


cys 


Tl o 


470 




Ala 


Val 


Leu 


Ala 


Leu 


Met 
485 


Leu 


Leu 


Thr 


Glu 


Pro 


Arg 


Arg 
500 


Lys 


Leu 


Gly 


Leu 


Trp 


Leu 


Arg 


Val 


Glu 



515 



Pro 


Ser 


pi 11 


Tyr 


Tift 

lie 


Ala 


Thr 


T 1 0 
















£t £t -J 


J.U 


Liys 




Ser 


Gin 


His 


He 








97 <; 










OAf\ 


ryi. 


T.011 

DC \jL 


Cor 


His 


Asp 


Gly Arg 








9^0 

^ -J \i 










£» -J -J 


Val 


His 




Phe 


Thr 


Asp 


Ala 


Val 
















270 


Leu 


Pro 


A. HI. 


Gin Gly 


He 


Asp 


Asp 
















w O J 


Ser 


Lys 


liii 


Leu 




Phe 


He 


Asp 






A 7 3 










J V V 


Glu Asp 




Lys 




Leu 


Ser 


Asp 
















7 1 R 

J X -J 


Asp 


Thr 


Phe 




rile 


Gly Gly 


His 






•IOC 










77 0 


Ser 


Trp 


v ex. JL 


Leu 


Tyr 


Asn 


Leu 


Ala 






7 A (\ 










J ft 3 


Arg 


Cys 


Arg 


Gin 


GlU 


Val 


Gin 








355 










7 fiO 
J O w 


Lys 


Glu 


He 


Glu 


Trp 


Asp 


Asp 






370 










07c 


Met 


Cys 


Val 


Lys 


Glu 


Ser 


Leu 


Arg 






385 










7 q n 


He 


Ser Arg 


Cys 


Cys 


Thr 


Gin 


Asp 






400 












Val 


He 


Pro 


Lys 


Gly 


He 


Thr 


Cys 






415 












His 


His 


Asn 


Pro 


Thr 


Val 


Trp 


Pro 






430 












Phe 


Arg 


Phe 


Asp 


Pro 


Glu 


Asn 


Ser 






445 










VI tl A 


Phe 


He 


Pro 


Phe 


Ser 


Ala Gly 


Pro 






460 










/ICC 


Phe 


Ala 


Met 


Ala 


Glu 


Met 


Lys 


vax 






475 












His 


Phe 


Arg 


Phe 


Leu 


Pro 


Asp 


His 






490 










495 


Glu 


Leu 


He 


Met 


Arg 


Ala 


Glu 


Gly 






505 










510 


Pro 


Leu 


Asn 


Val 


Gly 


Leu 


Gin 








520 













<210> 11 

<211> 144 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2408443CD1 

<400> 11 



Met 
1 


Val 


Thr 


Leu 


Tyr 
5 


Ser 


Ser 


Pro 


Ser 


Cys 
10 


Thr 


Ser 


Cys 


Arg 


Lys 
15 


Ala 


Lys 


Gin 


Trp 


Leu 
20 


Val 


Asp 


His 


Asn 


Leu 
25 


Pro 


Phe 


lie 


Glu 


Arg 
30 


Asn 


Leu 


Asn 


Lys 


Glu 
35 


Pro 


Leu 


Arg Ala Glu Asp Val 
40 


Lys 


Ala 


Met 
45 


Leu 


Arg 


Leu 


Thr 


Glu 
50 


Asp 


Gly 


Thr 


Glu 


Glu 
55 


Leu 


He 


Ser 


Thr 


Arg 
60 


Ser 


Lys 


He 


Phe 


Ser 
65 


Glu 


Leu 


Thr 


He 


Asp 
70 


Leu 


Asp 


Asp 


Met 


Ser 
75 


He 


Asn 


Lys 


Leu 


He 


Asp 


Leu 


He 


Val 


Met 


Tyr 


Pro 


Ser 


Leu 


Leu 
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80 








Lys 


Arg 


Pro 


He 


He 


Leu Asp 


Asp 










95 








Asn 


Asp 


Asp 


Glu 


He 
110 


Arg 


Arg 


Phe 


Arg 


Glu 


Leu 


He 


Arg 
125 


Ala 


Thr 


Phe 


Ala 


Lys 


Asp 


Leu 


Val 
140 


Val 


Glu 


Glu 



85 90 
Gin Arg Met Gin He Gly Tyr 

100 105 
Leu Pro Arg <3lu Val Arg Gin 

115 120 
Lys Ala Asp Phe Ala Glu Glu 

130 135 

Gly 



<210> 12 

<211> 373 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2508668CD1 

<400> 12 



Mec 


Trp 


Val Cys 


Ser 


Thr 


Leu 


Trp Arg Val 


Arg Thr 


Pro Ala 


Arg 


X 






5 










10 








15 


ply, 

Gin 


Trp 


Arg Gly Leu 


. Leu 


Pro 


Ala 


Ser -Gly 


Cys 


His 


Gly Pro 


Ala 








20 










25 








30 


Aid. 


O A V 


Ser Tyr 


Ser 


Ala 


Ser 


Ala 


Glu 


Pro 


Ala Arg Val Arg 


Ala 








35 










40 








45 


Leu 


val 


Tyr Gly His 


HIS 


bly 


Asp 


Pro 


Ala 


Lys 


Val 


Val Glu 


Leu 








50 










55 








60 


Lys 


Asn 


Leu Glu 


Leu 


Ala 


Aia 


Val 


Arg 


Gly 


Ser 


Asp Val Arg 


Val 








65 










7C 








75 


Lys 


Met 


Leu Ala 


Ala 


pro 


Tip 


Asn 


Pro 


Ser 


Asp 


He 


Asn Met 


He 








80 










85 








90 


Gin 


Glv 

vj - l y 


Asn Tyr 


Gly 


T .on 


IICU 


Pro 


Glu 


Leu 


Pro 


Ala 


Val Gly Gly 








95 










100 








105 


Asn 


Glu 


Gly Val 


Ala 


Gin 


Val 


Val 


Ala 


Val 


Gly 


Ser 


Asn Val 


Thr 








110 










115 








120 


Gly 


Leu 


Lys Pro 


Gly 


Asp 


Trp 


Val 


He 


Pro 


Ala Asn Ala Gly Leu 








125 










130 








135 


Gly 


Thr 


Trp Arg 


Thr 


Glu 


Ala 


Val 


Phe 


Ser 


Glu 


Glu 


Ala Leu 


He 








140 










145 








150 


Gin 


Val 


Pro Ser 


Asp 


He 


Pro 


Leu 


Gin 


Ser 


Ala 


Ala 


Thr Leu 


Gly 








155 










160 








165 


Val 


Asn 


Pro Cys 


Thr 


Ala 


Tyr 


Arg 


Met 


Leu 


Met 


Asp 


Phe Glu 


Gin 








170 










175 








180 


Leu 


Gin 


Pro Gly Asp 


Ser 


Val 


He 


Gin 


Asn 


Ala 


Ser 


Asn Ser 


Gly 








185 










190 








195 


Val 


Gly 


Gin Ala 


Val 


He 


Gin 


He 


Ala 


Ala 


Ala 


Leu 


Gly Leu 


Arg 








200 










205 








210 


Thr 


He 


Asn Val 


Val 


Arg 


Asp 


Arg 


Pro 


Asp 


He 


Gin 


Lys Leu 


Ser 








215 










220 








225 


Asp 


Arg 


Leu Lys 


Ser 


Leu 


Gly 


Ala 


Glu 


His 


Val 


He 


Thr <3lu 


<?lu 








230 










235 








240 


Glu 


Leu 


Arg Arg 


Pro 


Glu 


Met 


Lys 


Asn 


Phe 


Phe Lys 


Asp Met 


Pro 








245 










250 








255 


Gin 


Pro 


Arg Leu 


Ala 


Leu 


Asn 


Cys 


Val 


Gly 


Gly Lys 


Ser Ser 


Thr 








260 










2*5 








270 


Glu 


Leu 


Leu Arg Gin 


Leu 


Ala 


Arg Gly Gly 


Thr 


Met 


Val Thr 


Tyr 








275 










280 








285 


Gly 


Gly 


Met Ala 


Lys 


Gin 


Pro 


Val 


Val 


Ala 


Ser 


Val 


Ser Leu 


Leu 








290 










295 








300 


lie 


Phe 


Lys Asp 


Leu 


Lys 


Leu 


Arg Gly 


Phe 


Trp 


Leu 


Ser Gin 


Trp 








305 










310 








315 


Lys 


Lys 


Asp His 


Ser 


Pro 


Asp 


Gin 


Phe 


Lys -Glu Leu 


He Leu 


Thr 








320 










325 








330 
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Leu cys Asp Leu lie Arg Arg Gly Gin Leu Thr Ala Pro Ala Cys 
335 340 345 

Ser Gin Val Pro Leu Gin Asp Tyr Gin Ser Ala Leu Glu Ala Ser 
350 355 360 

Met Lys Pro Phe lie Ser Ser Lys Gin lie Leu Thr Met 
365 370 



<210> 13 

<211> 305 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc — feature 

<223> Incyte ID No: 253 683 0CD1 



<400> 13 



Met 


Ala 


Lys 


Phe 


Val 


He Ala Gly Arg Ala Asp 


Cys 


Pro 


Tyr 


Tyr 


1 






5 










10 










15 


Ala 


Lys 


Thr 


Glu 


Leu 


Val 


Ala Asp 


Tyr 


Leu 


Gin 


Lys 


Asn 


Leu 


Pro 








20 










25 










30 


Asp 


Phe 


Arg 


He 


His 


Lys 


He 


Thr 


Gin 


Arg 


Pro 


Glu 


Val 


Trp 


Glu 






35 










40 










45 


Asp 


Trp 


Leu 


Lys 


Asp 


Val 


Cys 


Glu 


Lys 


Asn 


Lys 


Trp 


ser 


His 


Lys 






50 










55 










60 


Asn 


Ser 


Pro 


He 


He 


Trp 


Arg 


GlU 


Leu 


Leu 


Asp 


Arg Gly Gly 


Lys 










65 






70 










75 


Gly 


Leu 


Leu 


Leu 


Gly 


Gly 


Tyr 


Asn 


Glu 


Phe 


Leu 


Glu 


His 


Ala 


Gin 








80 










85 










90 


Leu 


Tyr 


Tyr 


Asp 


Val 


Thr 


Ser 


Ser 


Met 


Thr 


Thr 


Glu 


Leu 


Met 


Met 




95 










100 










JLUD 


Val 


He 


Ala 


Gin 


Glu 


Asn 


Leu Gly Ala 


His 


He 


Glu 


Lys 


Glu 


Gin 










110 










115 










i on 


Glu 


Glu 


Glu 


Ala 


Leu 


Lys 


Thr 


Cys 


He 


Asn 


Pro 


Leu 


Gin 


Val 


Trp 










125 






130 










135 


He 


Thr 


Ser 


Ala 


Ser 


Ala 


Pro 


Ala 


Cys 


Tyr 


Asn 


Leu 


He 


Pro 


lie 










140 








145 










150 


Leu 


Thr 


Ser 


Gly 


Glu 


Val 


Phe Gly Met 


His 


Thr 


Glu 


He 


Ser 


He 








155 










160 










165 


Thr 


Leu 


Phe 


Asp 


Asn 


Lys 


Gin 


Ala 


Glu 


Glu 


His 


Leu 


Lys 


Ser 


Leu 








170 








175 










180 


Val 


Val 


Glu 


Thr 


Gin 


Asp 


Leu 


Ala 


Ser 


Pro 


Val 


Leu 


Arg 


Ser 


Val 










185 








190 










195 


Ser 


He 


Cys 


Thr 


Lys 


Val 


Glu 


Glu 


Ala 


Phe 


Arg 


Gin 


Ala 


His 


Val 








200 










205 










210 


He 


Val 


Val 


Leu 


Asp 


Asp 


Ser 


Thr 


Asn 


Lys 


Glu 


Val 


Phe 


Thr 


Leu 










215 








220 










225 


Glu Asp Cys 


Leu 


Arg 


Ser 


Arg 


Val 


Pro 


Leu 


Cys 


Arg 


Leu 


Tyr 


Gly 










230 










235 










240 


Tyr 


Leu 


He 


Glu 


Lys 


Asn 


Ala 


His 


Glu 


Ser 


Val 


Arg 


Val 


He 


Val 








245 










250 










255 


Gly Gly Arg 


Thr 


Phe 


Val 


Asn 


Leu 


Lys 


Thr 


Val 


Leu 


Leu 


Met 


Arg 










260 










265 










270 


Tyr 


Ala 


Pro Arg 


lie 


Ala 


His 


Asn 


He 


He 


Ala 


Val 


Ala 


Leu 


Gly 








275 










280 










285 


Val 


Glu Gly Glu Ala 


Lys 


Ala 


He 


Leu 


Ala 


Arg 


Lys 


Leu 


Lys 


Thr 










290 








295 










300 


Ala 


Pro 


Ser 


Cys 


Glu 
305 























<210> 14 
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WO 01/44448 

<211> 500 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2645179CD1 

<400> 14 

Met Glu Ala Ala Arg Pro Pro Pro Thr 

1 5 
Val Gly Gly Gly lie Ala Gly Val Thr 
20 

Thr His Phe Pro Ser Glu Asp lie Leu 
35 

Val lie Lys Ala Val Thr Asn Phe Lys 
50 

Glu Glu Phe Asp Val Glu Glu Gin Ser 

65" 

Arg Phe Pro Asn lie Lys Val lie Glu 
80 

Lys Ser Glu Glu His Cys lie Val Thr 
95 

Val Tyr Lys Lys Leu Cys Leu Cys Ala 
110 

He Cys Glu Gly Asn Pro Tyr Val Leu 
125 

Ser Ala Gin Glu Phe Gin Lys Gin Leu 
140 

Met He He Gly Asn Gly Gly He Ala 
155 

He Glu Gly Cys Glu Val lie Trp Ala 
170 

Gly Asn Thr Phe Phe Asp Ala Gly Ala 
185 

Lys Leu He Ala Glu Lys Ser Glu Ala 
200 

Thr Arg Tyr Thr Thr Glu Gly Arg Lys 
215 

Ser Lys Ala Asp Asn Val <31y Ser Ala 
230 

Glu Gly Leu Asn Leu Lys Gly Thr Lys 
245 

His Leu Glu Thr Met Cys Glu Val Lys 
260 

Glu Phe Arg He Leu Lys Lys Lys Ser 
. 275 

His Lys Ser Val Thr Ala Asp Thr Glu 
290 

Glu Leu Thr Asn Glu Lys lie Tyr Gly 
305 

Ala Thr Gly Val Thr Pro Asn Val Glu 
320 

Ser Phe Asp Leu Gly Glu Asp Gly Gly 
335 

Met His Thr Ser Leu Pro Asp He Tyr 
350 

Thr Thr Ser Trp Gin Leu Ser Pro Val 
365 

Trp Thr Gin Ala Arg Gin Met Gly Trp 
380 

Ala Ala Ala Ser Ser Gly Asp Ser He 
395 

Glu Leu Phe Ala His Val Thr Lys Phe 
410 

Leu Leu Gly Lys Tyr Asn Ala Gin Gly 
425 



PCT/US00/33158 



Ala Gly Lys 


Phe 


Val 


Vet J. 


10 












Cys 


Ala 


Glu 


Gin 


Leu 


ilia 


25 












Leu 


Val 


Thr 


Ala 


Ser 


Pro 


40 










A C 


Gin 


lie, 


Ser 


Lys 


He 


Leu 


55 










DU 


Ser 


Thr 


Met 


Leu Gly 


Lys 


70 










75 


Ser 


Gly Val 


Lys 


Gin 


Leu 


85 












Glu Asp Gly Asn Gin 


His 


100 










103 


Gly Ala Lys 


Pro 


Lys 


Leu 


115 










120 


Gly 


He Arg Asp 


Thr 


Asp 


130 










135 


Thr 


Lys 


Ala 


Lys 


Arg 


He 


145 










150 


Leu 


Glu 


Leu 


Val 


Tyr 


Glu 


160 








165 


He 


Lys 


Asp 


Lys 


Ala 


He 


175 










180 


Ala 


Glu 


Phe 


Leu 


Thr 


Ser 


190 










1 ft c 

195 


Lys 


He 


Ala 


His 


Lys 


Arg 


205 










210 


Lys 


Glu 


Ala 


Arg 


Ser 


Lys 


220. 










225 


Leu 


Gly 


Pro 


Asp 


Trp 


His 


235 










240 


Glu 


Phe 


Ser 


His 


Lys 


He 


250 










255 


Lys 


He 


Tyr Leu 


Gin 


Asp 


265 










270 


Phe 


Thr 


Phe 


Pro 


Arg 


Asp 


280 










285 


Met 


Trp 


Pro 


Val 


Tyr 


Val 


295 










300 


Cys 


Asp 


Phe 


He 


Val 


Ser 


310 










315 


Pro 


Phe 


Leu 


His 


Gly Asn 


325 










330 


Leu 


Lys 


Val 


Asp Asp 


His 


340 










345 


Ala Ala Gly Asp 


He 


Cys 


355 










360 


Trp 


Gin Gin 


Met 


Arg 


Leu 


370 










375 


Tyr 


Ala 


Ala 


Lys 


Cys 


Met 


385 










390 


Asp 


Met 


Asp 


Phe 


Ser 


Phe 


400 










405 


Phe 


Asn 


Tyr 


Lys 


Val 


Val 


415 










420 


Leu Gly 


Ser 


Asp 


His 


Glu 


430 










435 
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Leu Met Leu Arg Cys Thr Lys Gly Arg Glu Tyr lie Lys Val Val 

440 445 450 

Met Gin Asn Gly Arg Met Met Gly Ala Val Leu lie Gly Glu Thr 

455 460 465 

Asp Leu Glu Glu Thr Phe Glu Asn Leu lie Leu Asn Gin Met Asn 

470 475 480 

Leu Ser Ser Tyr Gly Glu Asp Leu Leu Asp Pro Asn lie Asp lie 

485 490 495 

Glu Asp Tyr Phe Asp 

500 



<210> 15 

<211> 369 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2754425CD1 

<400> 15 

Met Val Trp Ala Pro Leu Gly Pro Pro Arg Thr Asp Cys Leu Thr 
1 5 10 15 

Leu Leu His Thr Pro Ser Lys Asp Ser Pro Lys Met Ser Leu Glu 
20 25 30 

Trp Leu Val Ala Trp Ser Trp Ser Leu Asp Gly Leu Arg Asp Cys 
35 40 45 

lie Ala Thr Gly lie Gin Ser Val Arg Asp Cys Asp Thr Thr Ala 
50 55 60 

Val lie Thr Val Ala Cys Leu Leu Val Leu Phe Val Trp Tyr Cys 
65 70 75 

Tyr His Val Gly Arg Glu Gin Pro Arg Pro Tyr Val Ser Val Asn 
80 85 90 

Ser Leu Met Gin Ala Ala Asp Ala Asn Gly Leu Gin Asn Gly Tyr 
95 100 105 

Val Tyr Cys Gin Ser Pro Glu Cys Val Arg cys Thr His Asn Glu 

110 115 120 

Gly Leu Asn Gin Lys Leu Tyr His Asn Leu Gin Glu Tyr Ala Lys 

125 130 135 

Arg Tyr Ser Trp Ser Gly Met Gly Arg lie His Lys Gly He Arg 

140 145 150 

Glu Gin Gly Arg Tyr Leu Asn Ser Arg Pro Ser He Gin Lys Pro 

155 160 165 

Glu Val Phe Phe Leu Pro Asp Leu Pro Thr Thr Pro Tyr Phe Ser 

170 175 180 

Arg Asp Ala Gin Lys His Asp Val Glu Val Leu Glu Arg Asn Phe 

185 190 195 

Gin Thr He Leu Cys Glu Phe Glu Thr Leu Tyr Lys Ala Phe Ser 

200 205 210 

Asn Cys Ser Leu Pro Gin Gly Trp Lys Met Asn Ser Thr Pro Ser 

215 220 225 

Gly Glu Trp Phe Thr Phe Tyr Leu Val Asn Gin Gly Val Cys Val 

230 235 240 

Pro Arg Asn Cys Arg Lys Cys Pro Arg Thr Tyr Arg Leu Leu Gly 

245 250 255 

Ser Leu Arg Thr Cys He Gly Asn Asn Val Phe Gly Asn Ala Cys 

260 265 270 

He Ser Val Leu Ser Pro Gly Thr Val lie Thr Glu His Tyr Gly 

275 280 285 

Pro Thr Asn He Arg He Arg Cys His Leu Gly Leu Lys Thr Pro 

290 295 300 

Asn Gly cys Glu Leu Val Val Gly Gly Glu Pro Gin Cys Trp Ala 

305 310 315 

Glu Gly Arg Cys Leu Leu Phe Asp Asp Ser Phe Leu His Ala Ala 

14/40 
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320 325 330 

Phe His <31u Gly Ser Ala Glu Asp Gly Pro Arg Val Val Phe Met 

335 340 345 

Val Asp Leu Trp His Pro Asn Val Ala Ala Ala Glu Arg Gin Ala 

350 355 360 

Leu Asp Phe lie Phe Ala Pro Gly Arg 

365 



<210> 16 
<211> 145 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 2821526CD1 

<400> 16 ' 



Met 


Glu 


Leu 


Val 


Gin 


Val 


Leu 


Lys 


Arg 


Gly 


Leu 


Gin 


Gin 


He 


Thr 


1 








5 










10 










15 


Gly 


His 


Gly 


Gly 


Leu 


Arg 


Gly 


Tyr 


Leu 


Arg 


Val 


Phe 


Phe 


Arg 


Thr 










20 








25 










30 


Asn 


Asp 


Ala 


Lys 


Val 
35 


Gly 


Thr 


Leu 


Val 


Gly 
40 


Glu 


Asp 


Lys 


Tyr 


Gly 
45 


Asn 


Lys 


Tyr 


Tyr 


Glu 


Asp 


Asn 


Lys 


Gin 


Phe 


Phe 


Gly Arg 


His 


Arg 










50 










55 










60 


Trp 


Val 


Val 


Tyr 


Thr 


Thr 


Glu 


Met 


Asn Gly 


Lys 


Asn 


Thr 


Phe 


Trp 










65 










70 










75 


Asp 


Val 


Asp 


Gly 


Ser 
80 


Met 


Val 


Pro 


Pro 


Glu 
85 


Trp 


His 


Arg 


Trp 


Leu 
90 


His 


Ser 


Met 


Thr 


Asp 
95 


Asp 


Pro 


Pro 


Thr 


Thr 
100 


Lys 


Pro 


Leu 


Thr 


Ala 
105 


Arg 


Lys 


Phe 


He 


Trp 
110 


Thr 


Asn 


His 


Lys 


Phe 
115 


Asn 


Val 


Thr 


Gly 


Thr 
120 


Pro 


Glu 


Gin 


Tyr 


Val 
125 


Pro 


Tyr 


Ser 


Thr 


Thr 
130 


Arg 


Lys 


Lys 


He 


Gin 
135 


Glu 


Trp 


He 


Pro 


Pro 
140 


Ser 


Thr 


Pro 


Tyr 


Lys 
145 













<210> 17 
<211> 255 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 2876494CD1 

<400> 17 



Met 


Lys 


Val 


Leu 


Ala 


Thr 


Ser 


Phe 


Val 


Leu 


Gly 


Ser 


Leu Gly 


Leu 


1 








: 5 










10 








15 


Ala 


Phe 


Tyr 


Leu 


Pro 


Leu 


Val 


Val 


Thr 


Thr 


Pro 


Lys 


Thr Leu 


Ala 










20 










25 








30 


lie 


Pro 


Glu 


Lys 


Leu 


Gin 


Glu 


Ala 


Val 


Gly 


Lys 


Val 


He He 


Asn 










35 










40 








45 


Ala 


Thr 


Thr 


Cys 


Thr 


Val 


Thr 


Cys 


Gly 


Leu 


Gly 


Tyr 


Lys Glu 


Glu 










50 










55 








60 


Thr 


Val 


Cys 


Glu 


Val 


Gly 


Pro 


Asp Gly Val 


Arg 


Arg 


Lys Cys 


Gin 










65 










70 








75 


Thr 


Gin 


Arg 


Leu 


Glu 


Cys 


Leu 


Thr 


Asn 


Trp 


He 


Cys 


Gly Met 


Leu 










80 










85 








90 


His 


Phe 


Thr 


He 


Leu 


He 


Gly 


Lys 


Glu 


Phe 


Glu 


Leu 


Ser Cys 


Leu 
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Ser 


Ser 


Asp 


lie 


Leu 
110 


Glu 


Phe 


Trt> 


Arg 


Leu 


Ala 


125 


GlV 


Val 


Lys 


Pro 


Phe 


Gin 


140 


Asn 


Ser 


Gin 


Glu 


x Jr ± - 


Asp 


Ser 
155 


GlV 


Thr 


Lys 


Asn 


Leu 


Arg 


Leu 
170 


Val 




Lieu. 


Pro 


Pro 


Asn 


Leu 
185 


Val 


Asn 


Glu 


Asp 


Gin 


Lys 


Leu 
200 


He 


Asp 


Ser 


Tyr 


Ser 


Lys 


Pro 
215 


His 


His 


Ser 


Ala 


Leu Gly 


He 


Gly 


He 










230 






Leu 


Val 


Arg 


lie 


Val 


Leu 


Cys 



245 







100 








105 


GlV 


Gin 


Glu 


Ala Phe 


Arc 


Phe 


Thr 




115 








120 


He 


Ser 


Thr 


Asp Asp 


GlU 


Val 


Phe 






130 








135 


His 


Phe 


Val 


Lys Phe 


Lys 


Tvx 


Ala 






145 








150 


Tvx 


Arg 


Cys 


Asp Val 


Gin 


Leu 


Val 






160 








165 


Arg 


Leu 


Tyr 


Phe Gly 


Leu 


Arg 


Val 






175 








180 


Leu 


Asn 


Phe 


His Gin 


Ser 


Leu 


Thr 






190 








195 


Glu 


Gly Leu 


Glu Val 


Asn 


Leu 


Asp 






205 








210 


Pro 


Lys 


Trp 


Lys Lys 


Lys 


Val 


Ala 






220 








225 


Ala 


He 


Gly 


Val Val 


Gly 


Gly Val 






235 








240 


Ala 


Leu 


Arg 


Gly Gly Leu Gin 


Gin 






250 








255 



<210> 18 

<211> 246 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3403225CD1 

<400> 18 



Met 


Leu 


Val 


Thr 


Leu 


Gly Leu Leu 


Thr 


Ser 


Phe Phe 


Ser 


Phe 


Leu 


1 








5 








10 








15 


Tyr 


Met 


Val 


Ala 


Pro 


Ser 


He Arg 


Lys 


Phe 


Phe Ala Gly Gly Val 










20 








25 








30 


Cys 


Arg 


Thr 


Asn 


Val 


Gin 


Leu Pro 


Gly Lys 


Val Val 


Val 


He 


Thr 










35 








40 








45 


Gly 


Ala 


Asn 


Thr 


Gly 
50 


He 


Gly Lys 


Glu 


Thr 
55 


Ala Arg 


Glu 


Leu 


Ala 
60 


Ser 


Arg 


Gly 


Ala 


Arg 


Val 


Tyr lie 


Ala 


Cys 


Arg Asp Val 


Leu 


Lys 










65 








70 








75 


Gly 


Glu 


Ser 


Ala 


Ala 
80 


Ser 


Glu He 


Arg 


Val 
85 


Asp Thr 


Lys 


Asn 


Ser 
90 


Gin 


Val 


Leu 


Val 


Arg 


Lys 


Leu Asp 


Leu 


Ser 


Asp Thr 


Lys 


Ser 


He 










95 






100 








105 


Arg 


Ala 


Phe 


Ala 


Glu 


Gly 


Phe Leu 


Ala 


Glu 


Glu Lys 


Gin 


Leu 


His 










110 






115 








120 


He 


Leu 


He 


Asn 


Asn 


Ala Gly Val 


Met 


Met 


Cys Pro 


Tyr 


Ser 


Lys 










125 








130 








135 


Thr 


Ala 


Asp 


Gly 


Phe 
140 


Glu 


Thr His 


Leu 


Gly 
145 


Val Asn 


His 


Leu 


Gly 
150 


Thr 


Gly Val 


Thr 


Thr 


Tyr 


Ala Val 


His 


Pro 


Gly Val 


Val 


Arg 


Ser 










155 






160 








165 


Glu 


Leu 


Val 


Arg 


His 
170 


Ser 


Ser Leu 


Leu 


Cys 
175 


Leu Leu 


Trp 


Arg 


Leu 
180 


Phe 


Ser 


Pro 


Phe 


Val 


Lys 


Thr Ala 


Arg 


Glu 


Gly Ala Gin 


Thr 


Ser 










185 






190 








195 


Leu 


His 


Cys 


Ala 


Leu 


Ala Glu Gly 


Leu 


Glu 


Pro Leu 


Ser 


Gly 


Lys 










200 








205 








210 


Tyr 


Phe 


Ser 


Asp 


Cys 
215 


Lys 


Arg Thr 


Trp 


Val 
220 


Ser Pro 


Arg 


Ala 


Arg 
225 


Asn 


Asn 


Lys 


Thr 


Ala 
230 


Glu 


Arg Leu 


Trp 


Asn 
235 


Val Ser 


Cys 


Glu 


Leu 
240 
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Leu Gly lie Arg Trp Glu 
245 



<210> 19 
<211> 467 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4163943CD1 

<400> 19 

Met Ala Ala Ala Ala Arg Ala Arg Val Ala Tyr Leu Leu Arg Gin 
1 5 10 15 

Leu Gin Arg Ala Ala Cys Gin Cys Pro Thr His Ser His Thr Tyr 
20 25 30 

Ser Gin Ala Pro Gly Leu Ser Pro Ser Gly Lys Thr Thr Asp Tyr 
35 40 . 45 

Ala Phe Glu Met Ala Val Ser Asn lie Arg Tyr Gly Ala Ala Val 
50 55 60 

Thr Lys Glu Val Gly Met Asp Leu Lys Asn Met Gly Ala Lys Asn 
65 70 75 

Val Cys Leu Met Thr Asp Lys Asn Leu Ser Lys Leu Pro Pro Val 
80 85 90 

Gin Val Ala Met Asp Ser Leu Val Lys Asn Gly He Pro Phe Thr 
95 100 105 

Val Tyr Asp Asn Val Arg Val Glu Pro Thr Asp Ser Ser Phe Met 

110 115 120 

Glu Ala He Glu Phe Ala Gin Lys Gly Ala Phe Asp Ala Tyr Val 

125 130 135 

Ala Val Gly Gly Gly Ser Thr Met Asp Thr Cys Lys Ala Ala Asn 

140 145 150 

Leu Tyr Ala Ser Ser Pro His Ser Asp Phe Leu Asp Tyr Val Ser 

155 160 165 

Ala Pro He Gly Lys Gly Lys Pro Val Ser Val Pro Leu Lys Pro 

170 175 180 

Leu lie Ala Val Pro Thr Thr Ser Gly Thr Gly Ser Glu Thr Thr 

185 190 195 

Gly Val Ala He Phe Asp Tyr Glu His Leu Lys Val Lys lie Gly 

200 205 210 

He Thr Ser Arg Ala He Lys Pro Thr Leu Gly Leu He Asp Pro 

215 220 225 

Leu His Thr Leu His Met Pro Ala Arg Val Val Ala Asn Ser Gly 

230 235 240 

Phe Asp Val Leu Cys His Ala Leu Glu Ser Tyr Thr Thr Leu Pro 

245 2S0 255 

Tyr His Leu Arg Ser Pro Cys Pro Ser Asn Pro He Thr Arg Pro 

260 265 270 

Ala Tyr Gin Gly Ser Asn Pro He Ser Asp He Trp Ala He His 

275 280 285 

Ala Leu Arg He Val Ala Lys Tyr Leu Lys Arg Ala Val Arg Asn 

290 295 300 

Pro Asp Asp Leu Glu Ala Arg Ser His Met His Leu Ala Ser Ala 

305 310 315 

Phe Ala Gly He Gly Phe Gly Asn Ala Gly Val His Leu Cys His 

320 325 330 

Gly Met Ser Tyr Pro He Ser Gly Leu Val Lys Met Tyr Lys Ala 

335 340 345 

Lys Asp Tyr Asn Val Asp His Pro Leu Val Pro His Gly Leu Ser 

350 355 360 

Val Val Leu Thr Ser Pro Ala Val Phe Thr Phe Thr Ala Gin Met 

365 370 375 

Phe Pro Glu Arg His Leu Glu Met Ala Glu He Leu Gly Ala Asp 
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380 385 390 

Thr Arg Thr Ala Arg lie Gin Asp Ala Gly Leu Val Leu Ala Asp 

395 400 405 

Thr Leu Arg Lys Phe Leu Phe Asp Leu Asp Val Asp Asp Gly Leu 

410 415 420 

Ala Ala Val Gly Tyr Ser Lys Ala Asp lie Pro Ala Leu Val Lys 

425 430 435 

Gly Thr Leu Pro Gin Glu Arg Val Thr Lys Leu Ala Pro Arg Pro 

440 445 450 
Gin Ser Glu Glu Asp Leu Ala Ala Leu Phe Glu Ala Ser Met Lys 

455 460 465 

Leu Tyr 



<210> 20 

<211> 317 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4293484CD1 

<400> 20 „ ■ 

Met Ala Asp Ser Ala Gin Ala Gin Lys Leu Val Tyr Leu Val Thr 
15 10 15 

Gly Gly Cys Gly Phe Leu Gly Glu His Val Val Arg Met Leu Leu 
20 25 30 

Gin Arg Glu Pro Arg Leu Gly Glu Leu Arg Val Phe Asp Gin His 
35 40 45 

Leu Gly Pro Trp Leu Glu Glu Leu Lys Thr Gly Thr Arg Asn Val 
50 55 60 

He Glu Ala Cys Val Gin Thr Gly Thr Arg Phe Leu Val Tyr Thr 
65 70 75 

Ser Ser Met Glu Val Val Gly Pro Asn Thr Lys Gly His Pro Phe 
80 85 90 

Tyr Arg Gly Asn Glu Asp Thr Pro Tyr Glu Ala Val His Arg His 
95 100 105 

Pro Tyr Pro Cys Ser Lys Ala Leu Ala Glu Trp Leu Val Leu Glu 
HO 115 120 

Ala Asn Gly Arg Lys Val Arg Gly Gly Leu Pro Leu Val Thr Cys 
125 130 135 

Ala Leu Arg Pro Thr Gly He Tyr Gly Glu Gly His Gin He Met 
140 145 150 

Arg Asp Phe Tyr Arg Gin Gly Leu Arg Leu Gly Gly Trp Leu Phe 
155 160 165 

Arg Ala He Pro Ala Ser Val Glu His Gly Arg Val Tyr Val Gly 
170 175 180 

Asn Val Ala Trp Met His Val Leu Ala Ala Arg Glu Leu Glu Gin 
185 190 195 

Arg Ala Thr Leu Met Gly Gly Gin Val Tyr Phe Cys Tyr Asp Gly 
200 205 210 

Ser Pro Tyr Arg Ser Tyr Glu Asp Phe Asn Met Glu Phe Leu Gly 
215 220 225 

Pro Cys Gly Leu Arg Leu Val Gly Ala Arg Pro Leu Leu Pro Tyr 
230 235 240 

Trp Leu Leu Val Phe Leu Ala Ala Leu Asn Ala Leu Leu Gin Trp 
245 250 255 

Leu Leu Arg. Pro Leu Val Leu Tyr Ala Pro Leu Leu Asn Pro Tyr 
260 265 270 

Thr Leu Ala Val Ala Asn Thr Thr Phe Thr Val Ser Thr Asp Lys 
275 280 285 

Ala Gin Arg His Phe Gly Tyr Glu Pro Leu Phe Ser Trp Glu Asp 
290 295 300 

Ser Arg Thr Arg Thr He Leu Trp Val Gin Ala Ala Thr Gly Ser 
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305 310 315 

Ala Gin 



<210> 21 

<211> 181 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 

<400> 21 



Met 


Phe 


Ser 


Ala 


He 












He 


Asn 


Asn 


Ala 


Gly 
20 


Ser 


Thr 


Ser Gly 


Trp 










35 


Leu 


Ser 


He 


Cys 


Thr 








50 


Asn 


Val 


Asp Asp 


Gly 










65 


His 


Arg 


Val 


Leu 


Pro 
80 


Lys 


Tyr 


Ala 


Val 


Thr 
95 


Arg 


Glu 


Ala 


Gin 


Thr 
110 


Val 


Val 


Glu 


Thr 


Gin 
125 


Glu 


Lys 


Ala 


Ala 


Ala 
140 


Glu 


Asp 


Val 


Ala 


Glu 
155 


His 


He 


Gin 


He 


Gly 
170 



Thr 



4440080CD1 



Arg 


Ser 


Gin 


His 


Ser 
i n 

1U 


Leu 


Ala 


Arg 


Pro 


Asp 


Lys 


Asp 


Met 


Phe 


Asn 
40 


Arg 


Glu 


Ala 


Tyr 


Gin 

55 


His 


He 


lie 


Asn 


lie 
70 


Leu 


Ser 


Val 


Thr 


His 
85 


Ala 


Leu 


Thr 


Glu 


Gly 
100 


His 


He 


Arg 


Ala 


Thr 
115 


Phe 


Ala 


Phe 


Lys 


Leu 








130 


Thr 


Tyr 


Glu 


Gin 


Met 
145 


Ala 


Val 


He 


Tyr 


Val 
160 


Asp 


He 


Gin 


Met 


Arg 
175 



Gly Val 


Asp 


He 


Cys 










15 


Thr 


Leu 


Leu 


Ser 


Gly 
3 0 


Val 


Asn 


Val 


Leu 


Ala 
45 


Ser 


Met 


Lys 


Glu 


Arg 
60 


Asn 


Ser 


Met 


Ser 


Gly 
75 


Phe 


Tyr 


Ser 


Ala 


Thr 
90 


Leu 


Arg 


Gin 


Glu 


Leu 
105 


Cys 


He 


Ser 


Pro 


Gly 
120 


His 


Asp 


Lys 


Asp 


Pro 
13 5 


Lys 


Cys 


Leu 


Lys 


Pro 
150 


Leu 


Ser 


Thr 


Pro 


Ala 
165 


Pro 


Thr 


Glu 


Gin 


Val 
180 



<210> 22 

<211> 360 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5495687CD1 

<400> 22 



Met 


Val Pro 


Ala 


Ala 


Gly Arg Arg 


Pro Pro 


Arg Val Met Arg 


Leu 


1 






5 






10 




15 


Leu 


Gly Trp Trp 


Gin 


Val 


Leu Leu 


Trp Val 


Leu Gly Leu Pro 


Val 








20 






25 




30 


Arg 


Gly Val 


Glu 


Val 


Ala 


Glu Glu 


Ser Gly 


Arg Leu Trp Ser 


Glu 








35 






40 




45 


Glu 


Gin Pro 


Ala 


His 


Pro 


Leu Gin 


Val Gly 


Ala Val Tyr Leu 


Gly 








50 






55 


60 


Glu 


Glu Glu 


Leu 


Leu 


His 


Asp Pro 


Met Gly Gin Asp Arg Ala 


Ala 








65 






70 




75 


Glu 


Glu Ala 


Asn 


Ala 


Val 


Leu -Gly Leu Asp 


Thr Gin -Gly Asp 


His 








80 






85 




90 
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Met 


Val 


Met 


Leu 


Ser 


Val 


He 


Pro 


Gly 


Glu 


Ala 


biU 


Asp 


Lys 


VcLX 










95 










100 












Ser 


Ser 


Glu 


Pro 


Ser 


Gly Val 


Thr 


Cys 


Gly Ala 




Gly Ala 












110 










115 










1 9 n 


Asp 


Ser 


Arg 


Cys 


Asn 


Val 


Arg 


GlU 


Ser 


Leu 


.pne 


s er 


Leu 


Asp 






125 




















J. J o 


Ala 


Gly 


Ala 


His 


Phe 


Pro 


Asp 


Arg 


pin 


Glu 


Glu 


Tyr 


Tyr 


mr 


r*i ii 








140 










14b 












Pro 


Glu 


Val 


■» 1 — 

Ala 


GlU 


Ser 


Asp 


TV 1 0 

a j. a 


nla 


Pro 


Thr 




Asp 


O a v 

oer 












155 










160 










IDj 


Asn 


Thr 


Glu 


Ser 


Leu 


Lys 


Ser 


Pro 


Lys 


Val 


Asn 


cys 


Glu 


GlU 












170 








175 










inn 

J. O \J 


Asn 


lie 


Thr 


Gly 


Leu 


Glu 


Asn 


Phe 


Thr 


Leu 


Lys 


Tl A 

i le 


Leu 


Asn 










185 










190 










1 Q C 


Ser 


Gin 


Asp 


Leu 


Met 


Asp 


Phe 


Leu 


Asn 


Pro 


Asn 


v»iy 


Ser 


Asp 


Cys 








200 








205 












Thr 


Leu 


Val 


Leu 


Phe 


Tyr 


Thr 


Pro 


Trp 


Cys 


Arg 


irne 


Ser 


Ala 


Ser 










215 








220 










one 


Leu 


Ala 


Pro 


His 


Phe 


Asn 


Ser 


Leu 


Pro 


Arg 


Ala 




Pro 


Ala 


Leu 










230 










235 












His 


Phe 


Leu 


Ala 


Leu 


Asp 


Ala 


Ser 


Gin 


His 


Ser 


Ser 


Leu 


Ser 


Thr 










245 








250 












Arg 


Phe 


Gly Thr 


Val 


Ala 


Val 


Pro 


Asn 


He 


Leu 


Leu 


Phe 


Gin 


ijiy 








260 










265 










Z f V 


Ala 


Lys 


Pro 


Met 


Ala Arg 


Phe 


Asn 


His 


Thr 


Asp 


Arg 


Thr 


Leu 


Pill 

bill 








275 










280 










o o c 


Thr 


Leu 


Lys 


He 


Phe 


He 


Phe 


Asn 


Gin 


Thr 


Gly 


He 


Glu 


Ala 


Lys 








290 










295 












Lys 


Asn 


Val 


Val 


Val 


Thr 


Gin 


Ala Asp 


Gin 


lie Gly Pro 


Leu 


Pro 








305 










310 












Ser 


Thr 


Leu 


He 


Lys 


Ser 


Val 


Asp 


Trp 


Leu 


Leu 


Val 


Phe 


Ser 


Leu 










320 










325 










330 


Phe 


Phe 


Leu 


He 


Ser 


Phe 


lie 


Met 


Tyr 


Ala 


Thr 


He 


Arg 


Thr 


Glu 










335 










340 










345 


Ser 


He 


Arg 


Trp 


Leu 


He 


Pro 


Gly Gin Glu Gin 


GlU 


His 


Val 


Glu 






350 










355 










360 



<210> 23 
<211> 476 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5527735CD1 

<400> 23 



Met 


Ala 


Ser 


Pro 


Phe 


Ser 


Gly 


Ala 


Leu 


Gin 


Leu 


Thr 


Asp 


Leu 


Asp 


1 








5 








10 










15 


Asp 


Phe 


He 


Gly 


Pro 


Ser 


Gin 


Glu 


Cys 


He 


Lys 


Pro 


Val 




Val 






20 










25 










30 


Glu 


Lys 


Arg 


Ala 


Gly 


Ser 


Gly 


Val 


Ala 


Lys 


He 


Arg 


He 


Glu 


Asp 








35 










40 










45 


Asp 


Gly 


Ser 


Tyr 


Phe 


Gin 


lie 


Asn 


Gin Asp 


Gly Gly Thr Arg Arg 




50 










55 










60 


Leu 


Glu 


Lys 


Ala 


Lys 


Val 


Ser 


Leu 


Asn 


Asp 


Cys 


Leu 


Ala 


Cys 


Ser 








65 










70 










75 


Gly 


Cys 


He 


Thr 


Ser 


Ala 


Glu 


Thr 


Val 


Leu 


He 


Thr 


Gin 


Gin 


Ser 






80 










85 










90 


His 


Glu 


Glu 


Leu 


Lys 


Lys 


Val 


Leu 


Asp 


Ala 


Asn 


Lys 


Met 


Ala 


Ala 










95 








100 










105 


Pro 


Ser 


Gin 


Gin 


Arg 
110 


Leu 


Val 


Val 


Val 


Ser 
115 


Val 


Ser 


Pro 


Gin 


Ser 
120 


Arg 


Ala 


Ser 


Leu 


Ala 


Ala 


Arg 


Phe 


Gin 


Leu 


Asn 


Pro 


Thr 


Asp 


Thr 
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125 



Ala 


Arg 


Lys 


Leu 


Thr 


Ser 


Phe 


Phe 


Lys 










140 










Val 


Phe 


Asp 


Thr 


Ala 


Phe 


Ser 


Arg 


His 










155 










Gin 


Arg 


Glu 


Phe 


Val 


Arg 


Arg 


Phe 


Arg 










170 










Gin 


Ala 


Leu 


Pro 


Leu 


Leu 


Ala 


Ser 


Ala 










185 










Tvx 


Ala 


Glu 


Lys 


Thr 


His 


Gly 


Ser 


Phe 










200 










Thr 


Ala 


Ara 


Ser 


Pro 


Gin 


Gin 


Val 


Met 










215 










Phe 


Phe 


Ala 


Gin 


Gin 


Gin 


His 


Leu 


Thr 










230 










Val 


Thr 


Val 


Met 


Pro 


Cvs 


Tvx 
.* 


Asp 


Lys 










245 










Pro 


Asp 


Phe 


Phe 


Asn 


Gin 


Glu 


His 


Gin 








260 










Val 


Leu 


Thr 


Thr 


GlV 


Glu 


Val 


Phe 


Ara 










275 










Val 


Ser 


Leu 


Pro 


ASTD 


Leu 


GlU 


Pro 


Ala 










290 










Ser 


Glv 


Ala 


Ser 


Ala 


Glu 


GlU 


Pro 


Thr 










305 










Ser 


GlV 


Gly Tyr 


Leu 


Glu 


His 


Val 


Phe 










320 










Leu. 


Phe 


Gly 


He 


His 


Val 


Ala 


Glu 


Val 








335 










Asn 




Asp 


Phe 


Gin 


Glu 


Val 


Thr 


Leu 






350 










Leu 


Leu 


His 


Phe 


Ala 


Met 


Ala 


Tyr 


Gly 










365 










Leu 


Val 


Gin 


Arg 


Leu 


Lys 


Arg Gly Arg 










380 










Glu 


Val 


Met 


Ala 


Gys 


Pro 


Ser 


Gly 


Cys 










395 










Leu 


Gin 


Ala 


Pro 


Asp 


Arg 


Pro 


Ser 


Arg 










410 










Glu 


Arg 


Leu 


Tyr 


Gly 


Met 


Val 


Arg 


Ala 










425 










Pro 


Gly 


Val 


Gin 


Glu 


Leu 


Tyr 


Thr 


His 










440 










Ser 


Glu 


Cys 


Ala 


Gly 


Arg 


Leu 


Leu 


His 










455 










Glu 


Lys 


Ala 


Ser 


Thr 


Gly 


Leu Gly 


He 



470 



PCT/US00/33158 



130 










135 


Lys 


He 


Gly Val 


His 


Phe 


145 










150 


Phe 


Ser 


Leu 


Leu 


Glu 


Ser 


160 










165 


Gly. 


Gin 


Ala 


Asp 


Cys 


Arg 


175 










180 


Cys 


Pro 


Gly Trp 


He 


Cys 


190 










195 


lie 


Leu 


Pro 


His 


lie 


Ser 


205 










210 


Gly 


Ser 


Leu 


Val 


Lys 


Asp 


220 










225 


Pro 


Asp 


Lvs 


He 


Tyr 


His 


235 










240 


Lys 


Leu 


Glu 


Ala 


Ser 


Arg 


250 










255 


Thr 


Arg 


Asp 


Val 


Asp 


Cys 


265 










270 


Leu 


Leu 


Glu 


Glu 


Glu 


Gly 


280 










285 


Pro 


Leu 


Asp 


Ser 


Leu 


Cys 


295 










300 


Ser 


His 


Arg 


Gly Gly 


Gly 


310 










315 


Arc 


His 


Ala 


Ala 


Arg 


Glu 


325 










330 


Thr 


Tyr 


Lys 


Pro 


Leu 


Arg 


340 










345. 


Glu 


Lys 


Glu 


Gly 


Gin 


Val 


355 










360 


Phe 


Arg 


Asn 


He 


Gin 


Asn 


370 










375 


Cvs 

V, jf 


Pro 


Tyr 


His 


Tyr 


Val 


385 










390 


Leu 


Asn Gly Gly Gly 


Gin 


400 










405 


Glu 


Leu 


Leu 


Gin 


His 


Val 


415 










420 


Glu 


Ala 


Pro 


Glu 


Asp 


Ala 


430 










435 


Trp 


Leu 


Gin 


Gly 


Thr 


Asp 


445 










450 


Thr 


Gin 


Tyr 


His 


Ala 


Val 


460 








465 


Arg 


Trp 










475 













<210> 24 

<211> 621 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5540437CD1 

<400> 24 

Met Ser Gly Cys Gly Leu Phe Leu Arg Thr Thr Ala Ala Ala Arg 
1 5 10 15 

Ala Cys Arg Gly Leu Val Val Ser Thr Ala Asn Arg Arg Leu Leu 

20 25 30 

Arg Thr Ser Pro Pro Val Arg Ala Phe Ala Lys Glu Leu Phe Leu 

35 40 45 
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Gly 


Lys 


He 


Lys 


li 


T.vrc* 


Glu 


Val 


Phe 


Pro 


Phe 


Pro 


Glu 


Val 


Ser 


















55 










60 


Gin 


Asp 


Glu 


Leu 


noil 


Gl 11 


He 


Asn 


Gin 


Phe 


Leu Gly 


Pro 


Val 


Glu 


















70 










75 


Lys 


Phe 


Phe 


mi 




V7J. U 


Val 


Asp 


Ser 


Arg 


Lys 


He 


Asp 


Gin 


Glu 








ft n 










85 










90 


Gly 


Lys 


Tip 


Pro 


Asp 


Glu 


Thr 


Leu 


Glu 


Lys 


Leu 


Lys 


Ser 


Leu 


Glv 
















100 










105 


Leu 


Phe 


Glv 


Leu 


Gin 


Val 


Pro 


Glu 


Glu 


Tyr 


Gly Gly Leu 


Gly 


Phe 








110 










115 










120 


Ser 


Sen 


Thr 


Met 


lyi 


Ser 


Arg 


Leu Gly Glu 


Tie 

lie 


He 


Ser 


Met 


Asp 










125 








130 










135 


Gly 


OCl 


He 


1111 


Val 


Thr 


Leu 


Ala 


Ala 


His 


Gin 


Ala 


He 


Glv 


Leu 








140 










145 










150 


Lys 


Glv 

V?l Jf 


He 


lie 


Leu 


Ala 


Gly 


Thr 


Glu 


Glu 


Gin 


Lys 


Ala 


Lvs 


Tvr 








i ~j 










160 










165 


Leu 


Pro 


Lys 


Leu 


Ala 


Ser 


Gly Glu 


His 


He 


Ala 


Ala 


Phe 


CVS 


Leu 








1 / V/ 










175 










180 


X 111 


Gill 


Pro 


Ala 


OGl 


Gly 


Seir 


Asp 


Ala 


Ala 


Ser 


lie 


Arg 


Ser 


Arg 










1QC 
1 O J 






190 










195 


Ala 


Thr 


Leu 


Cor 


pi 1] 

1 LI 


TV en 


Lys 


Lys 


His 


Tyr 


He 


Leu 


Asn 


Gly 


Ser 
















205 










210 




Vul 


Trp 


lie 


mi 


Asn 


Gly 


Gly 


Leu Ala 


Asn 


lie 


Phe 


Thr 


Val 
















220 










225 


Phe 




r.ve 


J. Ill 


vl u. 


veil 


Val 


Asp 


Ser 


Asp 


Gly 


Ser 


Val 


Lys 


Asp 








OTA 










235 










240 




He 


Thr 


Ala 
r\ici 


lr lies 


He 


Val 


Glu 


Arg 


Asp 


Phe Gly Gly Val 


Thr 


















250 










255 


AS 11 


Gly 


Lys 


Pro 


Pin 


Asp 


Lys 


Leu 


Gly 


He 


Arg Gly 


Ser 


Asn 


Thr 








o £ n 






265 










270 


Cys 


Glu 


Val 


rli S 


rile 




Asn 


Thr 


Lys 


He 


Pro 


Val 


Glu 


Asn 


He 








A / z> 










280 










285 


L6U 


Gly Glu 


Val 


*j±y 


Asp 


Gly 


Phe 


Lys 


val 


Ala 


Met 


Asn 


He 


Leu 










*> on 






295 










300 


Asn 


Ser 


Gly 


Arg 


iriie 


Ser 


Met 


Gly 


Ser 


Val 


Val 


Ala Gly 


Leu 


Leu 






one 










310 










315 


Lys 


Arg 


Leu 


He 


Glu 


Met 


Thr 


Ala 


Glu 


Tyr 


Ala 


Cys 


Thr 


Arg 


Lys 
















325 










330 


bin 


Phe 


Asn 


Lys 


Arg 


Leu 


Ser 


Glu 


Phe Gly 


Leu 


He 


Gin 


Glu 


Lys 








"7 T 










340 










345 


Jr lit; 


Ala 


Leu 


l T lfcr L. 




Gl -n 
varlXl 


jjys 


Ala 


Tyr 


Val 


Met 


GlU 


Ser 


Met 


Thr 










"icn 
jjU 








355 










360 




Leu 


Thr 


Ala 


oiy 

JO J 


Mot* 


Leu 


Asp 


Gin 


Pro 
370 


Gly 


Phe 


Pro 


Asp 


Cys 
375 


Qor* 


He 


Glu 


AlCL 


AlGi 




Val 


Lys 


Val 


Phe 


Ser 


Ser 


Glu 


Ala 


Ala 










ft n 

J O U 








385 










390 


xrp 


Gin 


Cys 


Val 


Ser 


ul'U 


Ala 


Leu 


Gin 


lie 


Leu Gly Gly Leu 


Glv 










-J -7 -J 










400 










405 


J.yi 


Thr 


Arg 


2k en 

■no £J 


A Jf r 


Pro 


Tyr 


Glu 


Arg 


He 


Leu 


Arg 


Asp 


Thr 


Arg 


















415 










420 


lie 


Leu 


Leu 


He 


Phe 


Glu 


Gly 


Thr 


Asn 


Glu 


He 


Leu 


Arg 


Met 


Tvr 










*± 4a ~J 








430 










435 


lie 


Ala 


Leu 


Thr 


Gly 


Leu 


Gin 


His 


Ala Gly 


Arg 


He 


Leu 


Thr 


Thr 




















445 










450 


Arg 


He 


His 


Glu 


Leu 


Lys 


Gin 


Ala 


Lys 


Val 


Ser 


Thr 


Val 


Met 


Asp 










^± -j 








460 










465 


Thr 


Val 


Gly 


Arg 


Arg 


Leu 


Arg 


Asp 


Ser 


Leu 


Gly Arg 


Thr 


Val 


Asp 








47 0 








475 










480 


Leu 


Gly Leu 


Thr 


Glv 


Asn 


His 


Gly Val 


Val 


His 


Pro 


Ser 


Leu 


Ala 










•i 0} J 










490 










495 


Asp 


Ser 


Ala 


Asn 


Lys 


Phe 


Glu 


Glu 


Asn 


Thr 


Tyr 


Cys 


Phe 


Gly Arg 








500 










505 










510 


Thr 


Val 


Glu 


Thr 


Leu 


Leu 


Leu 


Arg 


Phe Gly 


Lys 


Thr 


He 


Met 


Glu 










515 








520 










525 


Glu 


Gin 


Leu 


Val 


Leu 


Lys 


Arg 


Val 


Ala 


Asn 


He 


Leu 


He 


Asn 


Leu 










530 








535 










540 


Tyr 


Gly Met 


Thr 


Ala 


Val 


Leu 


Ser 


Arg 


Ala 


Ser 


Arg 


Ser 


He 


Arg 
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He 


Gly 


Leu 


Arg 


Asn 


His 


Asp 


His 


Phe 


Cys 


Val 


Glu 


Ala 

O / 3 


Tyr 


Leu 


Gin 


Leu 


Asp 


Lys 


Tyr 


Ala 
590 


Pro 


Glu 


Asn 


Val 


Ser 


Gin 


Gin 


He 
605 


Leu 


Glu 


Lys 


Pro 


Leu 


Asp 


Arg 


Thr 
620 


Cys 







<210> 25 

<211> 245 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5596281CD1 

<400> 25 



Met 


Gly 


Arg 


Leu 


Asp 


Gly Lys 


Val 


1 








5 








Gin 


Gly 


He 


Gly 


Gin 
20 


Ala 


Ala 


Ala 


Ala 


Lys 


Val 


He 


Ala 

35 


Thr 


Asp 


He 


Leu 


GlU 


Lys 


Tyr 


Pro 
50 


Gly 


He 


Gin 


Lys 


Lys 


Lys 


Gin 


He 
65 


Asp 


Gin 


Phe 


Asp 


Val 


Leu 


Phe 


Asn 
80 


Val 


Ala 


Gly 


Leu 


Asp 


Cys 


Glu 


Glu 
95 


Lys 


Asp 


Trp 


Val 


Arg 


Ser 


Met 


Tyr 
110 


Leu 


Met 


He 


Leu 


Ala 


Gin 


Lys 


Ser 


Gly Asn 


He 










125 








Ser 


Ser 


Val 


Lys 


Gly 
140 


Val 


Val 


Asn 


Lys 


Ala 


Ala 


Val 


He 
155 


Gly 


Leu 


Thr 


He 


Gin 


Gin 


Gly 


He 
170 


Arg 


Cys 


Asn 


Asp 


Thr 


Pro 


Ser 


Leu 
185 


Gin 


Glu 


Arg 


GlU 


Glu 


Ala 


Arg 


Asn 
200 


Asp 


Phe 


Leu 


Phe 


Ala 


Thr 


Ala 


Glu 
215 


GlU 


He 


Ala 


Ser 


Asp 


Glu 


Ser 


Ala 
230 


Tyr 


Val 


Thr 


Gly 


Gly 


Trp 


Ser 


Leu 
245 









<210> 26 

<211> 159 

<212> PRT 

<213> Homo sapiens 





ccn 

J Jv 










-J -J 


Glu 


Val 


Leu 


Leu 


Ala 


Asn 


Thr 




JO J 












Asn 


Leu 


Phe 


Ser 


Leu 


Ser 


Gin 




580 










585 


Leu 


Asp 


Glu 


Gin 


He 


Lys 


Lys 




595 










600 


Arg 


Ala 


Tyr 


He 


Cys 


Ala 


His 




610 










615 



He 


He 


Leu 


Thr Ala Ala 


Ala 




10 








15 


Leu 


Ala 


Phe 


Ala Arg Glu 


Gly 




25 








30 


Asn 


Glu 


Ser 


Lys 


Leu Gin 


Glu 




40 








45 


Thr 


Arg 


Val 


Leu 


Asp Val 


Thr 




55 








60 


Ala 


Ser 


Glu 


Val 


Glu Arg 


Leu 




70 








75 


Phe 


Val 


His 


His 


Gly Thr 


Val 




85 








90 


Asp 


Phe 


Ser 


Met 


Asn Leu 


Asn 


100 








105 


Lys 


Ala, Phe Leu 


Pro Lys 


Met 




115 








120 


He 


Asn 


Met 


Ser 


Ser Val 


Ala 




130 








135 


Arg 


Cys 


Val 


Tyr 


Ser Thr 


Thr 




145 








150 


Lys 


Ser 


Val 


Ala 


Ala Asp 


Phe 




160 








165 


Cys 


Val 


Cys 


Pro Gly Thr 


Val 




175 








180 


He 


Gin 


Ala 


Arg 


Gly Asn 


Pro 




190 








195 


Lys 


Arg 


Gin 


Lys 


Thr Gly 


Arg 




205 








210 


Met 


Leu 


Cys 


Val 


Tyr Leu 


Ala 




220 








225 


Gly 


Asn 


Pro 


Val 


He He 


Asp 




235 








240 
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<220> 

<221> misc.feature 

<223> Incyte ID No: 5731013 

<400> 26 



Met 


Thr 


Ala 


Arg 


Gly Thr 


Pro 


Ser 


1 








5 






His. 


Asn 


Gly Leu 


Gly Arg 


Tyr 


Val 










20 






Phe 


Ser 


Val 


Ser 


Arg Asp Gly Ala 










35 






Phe 


Val 


Glu 


Arg 


Glu Val 


He 


Asp 










50 






Val 


Val 


He 


Tyr 


Val Asn 


Ser 


Arg 










65 






Val 


Ala 


Glu 


Tyr 


Leu Asn Gly Ala 










80 






Cys 


Lys 


Ser 


Val 


Glu Glu 


He 


Ser 








95 






Asp 


Gin 


Ser 


Gly 


Leu Asp 


Val 


lie 










110 






Thr 


Asp 


Asn 


Pro 


Ser He 


Gin 


Gly 








125 






Lys 


Pro 


Thr 


Thr 


Phe Arg 


Gly 


Leu 








140 






Pro 


Ala 


Pro 


Ala 


Gin Val 


Gin 


Ala 



155 



Arg 


Phe 


Leu Ala Ser 


Val 


Leu 


1U 






JLD 


Gin 


Gin 


Leu Gin Arg 


Leu 


Ser 












Ser 


Ser 


Arg Gly Ala Arg 


Oil! 




a n 








Phe 


Ala 


Arg Arg Asn 


Pro 






cc 






fin 


Pro 


Cys 


Cys Val Pro 


Arg 


vax 




/U 








Val 


Arg 


Glu Glu Ser 


He 


His 




85 






90 


Thr 


Leu 


Val Gin Lys 


Leu 


Ala 




100 






105 


Arg 


He 


Arg Lys Pro 


Phe 


His 


115 






120 


Gin 


Trp 


His Pro Phe 


Thr 


Asn 




130 






135 


Arg 


Pro 


Arg Glu Val 


Gin 


Asp 




145 






150 


Gin 











<210> 27 

<211> 291 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5731162CD1 

<400> 27 



Met 


Ala 


Cys 


Leu 


Ser 


Pro 


Ser 


Gin 


Leu 


Gin 


Lys 


Phe 


Gin 


Gin Asp 


1 






5 










10 








15 


Gly 


Phe 


Leu 


Val 


Leu Glu Gly 


Phe 


Leu 


Ser 


Ala 


Glu 


Glu 


Cys Val 








20 










25 








30 


Ala 


Met 


Gin 


Gin 


Arg 


lie Gly Glu 


He 


Val 


Ala 


Glu 


Met 


Asp Val 










35 










40 








45 


Pro 


Leu 


His 


Cys 


Arg 


Thr 


Glu 


Phe 


Ser 


Thr 


Gin 


Glu 


Glu 


Glu Gin 








50 










55 








60 


Leu 


Arg 


Ala 


Gin 


Gly 


Ser 


Thr 


Asp 


Tyr 


Phe 


Leu 


Ser 


Ser 


Gly Asp 








65 










70 








75 


Lys 


He 


Arg 


Phe 


Phe 


Phe 


Glu 


Lys 


Gly 


Val 


Phe 


Asp 


Glu 


Lys Gly 






80 










85 








90 


Asn 


Phe 


Leu 


Val 


Pro 


Pro 


Glu 


Lys 


Ser 


He 


Asn 


Lys 


He 


Gly His 










95 








100 








105 


Ala 


Leu 


His 


Ala 


His 


Asp 


Pro 


Val 


Phe 


Lys 


Ser 


He 


Thr 


His Ser 










110 








115 








120 


Phe 


Lys 


Val 


Gin 


Thr 


Leu 


Ala 


Arg 


Ser 


Leu 


Gly 


Leu 


Gin 


Met Pro 








125 










130 








135 


Val 


Val 


Val 


Gin 


Ser 


Met 


Tyr 


He 


Phe 


Lys 


Gin 


Pro 


His 


Phe Gly 










140 








145 








150 


Gly 


Glu 


Val 


Ser 


Pro 


His 


Gin 


Asp 


Ala 


Ser 


Phe 


Leu 


Tyr 


Thr Glu 








155 










160 








165 


Pro 


Leu Gly Arg 


Val 


Leu Gly Val 


Trp 


He 


Ala 


Val 


Glu 


Asp Ala 










170 










175 








180 


Thr 


Leu 


Glu 


Asn 


Gly 


Cys 


Leu 


Trp 


Phe 


He 


Pro 


Gly 


Ser 


His Thr 










185 








190 








195 
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Ser 


Gly 


Val 


Ser 


Arg 
200 


Pro 


Gly 


Thr 


Ser 


Phe 








215 


Leu 


Phe 


Val 


Pro 


Thr 
230 


His 


Gly 


Glu 


Val 


Val 








245 


Ser 


Arg 


Gin 


Ala 


Tyr 
260 


Thr 


Trp 


Ser 


Pro 


Glu 
275 


Phe 


Pro 


Gin 


Leu 


Tyr 
290 



Arg 


Met 


Val 


Arg 


Ala 
205 


Leu 


Gly 


Ser 


Glu 


Pro 
220 


Pro 


Val 


Gin 


Arg 


Gly 
235 


His 


Lys 


Ser 


Lys 


Gin 
250 


Thr 


Phe 


His 


Leu 


Met 
265 


Asn 


Trp 


Leu 


Gin 


Pro 
280 


Thr 











Pro 


Val 


Gly 


Ser 


Ala 










210 


Ala 


Arg 


Asp 


Asn 


Ser 










225 


Ala 


Leu 


Val 


Leu 


He 










240 


Asn 


Leu 


Ser 


Asp 


Arg 










255 


Glu 


Ala 


Ser 


Gly 


Thr 










270 


Thr 


Ala 


Glu 


Leu 


Pro 



285 



<210> 28 
<211> 1557 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 543496CB1 



<400> 28 

ctgagtgcga 

gtgcgtgcag 

acagacaccg 

tgtgccttgg 

ccaaagaaag 

cctggctctg 

tacagagcct 

gataaggata 

ctcactaagc 

attcgctata 

accctaggtg 

tccggagtac 

gttgtggcta 

cttccctctg 

tccacgtccc 

gccgttaact 

gccatgctgc 

cccccaagcg 

catgctgctg 

catccgcttg 

cacctcagta 

tatgaaacag 

aggctctatt 

acaaatgcag 

ctctcctaaa 

tattttcaag 



cggcgcaggt 
ctccccacag 
tgtatgacgt 
gatatgatat 
tactggagaa 
caacgcttct 
ttcggcgaat 
atttagatga 
agttggaggc 
cctggccttg 
atggcagcac 
ggcaggctgt 
ctctgcattt 
ggcctattgc 
atgaacatgc 
ctgccttttg 
agtatgctgt 
tagccagggt 
agtacgtcag 
caggacaggg 
cggcagcctt 
aaagacagcg 
ctaccagtgc 
tgtctccact 
gaaagattac 
atcttattta 



ctgcaccatg 
cggcccgctg 
ggtggtgtcg 
tcactttcat 
attgtcagaa 
cagtagtttt 
gcaggtgtgg 
catgggctat 
tgtgtctgac 
tccatttcct 
cttccagacc 
tggaatccag 
atcagaggcc 
tctgctcccg 
agcagagcta 
gag t gat get 
cagccttctg 
ggatgccaaa 
gectegggtg 
tgtcaacatg 
caatgggaag 
tcacaacact 
ctccccgctt 
caaagaacag 
gttgatgaaa 
atttaataaa 



gcggcccggc 
gtgtcctggc 
ggtggaggcc 
gacaagaaaa 
acttacagca 
ggtgcctggg 
gacgcctgct 
atcgtggaga 
egagtgaegg 
atggccgact 
aaattgttga 
aatgtgagct 
acagaaaaca 
ctctcagaca 
gttagcatgg 
gaccacacgg 
aagcccacta 
agecgagtte 
gcgctcattg 
ggctttgggg 
gacttaggtt 
gctcttctgg 
gtgttgctca 
attatggect 
aagaacatcc 
cttiactttac 



ttgtcagccg 
gcaggtggtc 
tggtgggcgc 
tcctgttgct 
acagggtcag 
accatatctg 
cagaggccct 
atgatgtcat 
ttctctacag 
ccagcccttg 
taggtgeaga 
ggaactatga 
aegtagectg 
ccttgagttc 
atgaggaaaa 
acttcatcga 
aggtctegge 
tgtttcctct 
gggatgeage 
atatctccag 
ccgtgagcca 
ctgetacaga 
ggacgtgggg 
ttgcaagcaa 
tgoccaggac 
attaaaaaaa 



atgegggget 
cggcgcctca 
tgccatggcc 
cgaagcaggt 
ctccatttcc 
caacatgaga 
gataatgttt 
catgeatget 
gagcaaagee 
ggttcatatt 
tggtcacaac 
ccagtctgct 
gcagagattt 
cttggtttgg 
atttgtggat 
cacagctggt 
tcgccagctg 
tgggttggga 
ccacagagtc 
cttggcccat 
cctcacaggt 
cttactaaaa 
cttgcaggcc 
atgagtactc 
ccatcataca 
aaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1557 



<210> 29 

<211> 1106 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



907607CB1 



<400> 29 

cccacgcgtc cgggttcccc ggcctctctt ggtcagggtg aegcagtage ctgcaaacct €0 
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cggcgcgtag gccaccgcac ttatccgcag caggaccgcc cgcagccggt agggtgggct 120 
cttcccagtg cccgcccagc taccggccag cctgcggctg cgcagatctt tcgtggttct 180 
gtcagggaga cccttaggca ctccggacta agatggcggc gacggccagg cggggctggg 240 
gagctgcggc tgttgccgcc gggctgcgca ggcggttctg tcatatgttg aagaatccat 300 
acaccattaa gaaacagcct ctgcatcagt ttgtacaaag accacttttc ccactacctg 360 
cagcctttta tcacccagtg agatacatgt ttattcaaac acaagatacc ccaaatccaa 420 
acagcttaaa gtttatacca ggaaaaccag ttcttgagac aaggaccatg gattttccca 480 
ccccagctgc agcatttcgc tcccctctgg ctaggcagtt atttaggatt gaaggagtaa 540 
aaagtgtctt ctttggacca gatttcatca ctgtcacaaa ggaaaatgaa gaattagact 600 
ggaatttact gaaaccagat atttatgcaa caatcatgga cttctttgca tctggcttac 660 
ccctggttac tgaggaaaca ccttcaggag aagcaggatc tgaagaagat gatgaagttg 720 
tggcaatgat taaggaattg ttagatacta gaatacggcc aactgtgcag gaagatggag 780 
gggatgtaat ctacaaaggc tttgaagatg gcattgtaca gctgaaactc cagggttctt 840 
gtaccagctg ccctagttca atcattactc tgaaaaatgg aattcagaac atgctgcagt 900 
tttatattcc ggaggtagaa ggcgtagaac aggttatgga tgatgaatca gatgaaaaag 960 
aagcaaactc accttaaaat aatctggatt ttctttgggc ataacagtca gacttgttga 1020 
taatatatat caagttttta ttattaatat gctgaggaac ttgaagatta ataaaatatg 1080 
ctcttcagag aatgatatat aaaaaa 1106 



<210> 30 

<211> 2180 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1290078CB1 

<400> 30 

gccccacaca ggccgcggcg gctggctcgg gcccctacgg tcccggcggc ggctggagga 60 
ggaagccagg cggctggcgg aggaggagag acggaggagg ccgagaccgg agcgccgctc 120 
gccgcagact tacttccccg gctcagcagg gaaaggttcc tagaaggtga gcgcggacgg 180 
tatgcaaagt tgtgaatcca gtggtgacag tgcggatgac cctctcagtc gcggcctacg 240 
gagaagggga cagcctcgtg tggtggtgat cggcgccggc ttggctggcc tggctgcagc 300 
caaagcactt cttgagcagg gtttcacgga tgtcactgtg cttgaggctt ccagccacat 3 60 
cggaggccgt gtgcagagtg tgaaacttgg acacgccacc tttgagctgg gagccacctg 420 
gatccatggc tcccatggga accctatcta tcatctagca gaagccaacg gcctcctgga 480 
agagacaacc gatggggaac gcagcgtggg ccgcatcagc ctctattcca agaatggcgt 540 
ggcctgctac cttaccaacc acggccgcag gatccccaag gacgtggttg aggaattcag 600 
cgatttatac aacgaggtct ataacttgac ccaggagttc ttccggcacg ataaaccagt 660 
caatgctgaa agtcaaaata gcgtgggggt gttcacccga gaggaggtgc gtaaccgcat 720 
caggaatgac cctgacgacc cagaggctac caagcgcctg aagctcgcca tgatccagca 780 
gtacctgaag gtggagagct gtgagagcag ctcacacagc atggacgagg tgtccctgag 840 
cgccttcggg gagtggaccg agatccccgg cgctcaccac atcatcccct cgggcttcat 900 
gcgggttgtg gagctgctgg cggagggcat ccctgcccac gtcatccagc tagggaaacc 9 60 
tgtccgctgc attcactggg accaggcctc agcccgcccc agaggccctg agattgagcc 1020 
ccggggtgag ggcgaccaca atcacgacac tggggagggt ggccagggtg gagaggagcc 1080 
ccgggggggc aggtgggatg aggatgagca gtggtcggtg gtggtggagt gcgaggactg 1140 
tgagctgatc ccggcggacc atgtgattgt gaccgtgtcg ctaggtgtgc taaagaggca 1200 
gtacaccagt ttcttccggc caggcctgcc cacagagaag gtggctgcca tccaccgcct 1260 
gggcattggc accaccgaca agatctttct ggaattcgag gagcccttct ggggccctga 1320 
gtgcaacagc ctacagtttg tgtgggagga cgaagcggag agccacaccc tcacctaccc 1380 
acctgagctc tggtaccgca agatctgcgg ctttgatgtc ctctacccgc ctgagcgcta 1440 
cggccatgtg ctgagcggct ggatctgcgg ggaggaggcc ctcgtcatgg agaagtgtga 1500 
tgacgaggca gtggccgaga tctgcacgga gatgctgcgt cagttcacag ggaaccccaa 1560 
cattccaaaa cctcggcgaa tcttgcgctc ggcctggggc agcaaccctt acttccgcgg 1620 
ctcctattca tacacgcagg tgggctccag cggggcggat gtggagaagc tggccaagcc 1680 
cctgccgtac acggagagct caaagacagc gcccatgcag gtgctgtttt ccggtgaggc 1740 
cacccaccgc aagtactatt ccaccaccca cggtgctctg ctgtccggcc agcgtgaggc 1800 
tgcccgcctc attgagatgt accgagacct cttccagcag gggacctgag ggctgtcctc 1860 
gctgctgaga agagccacta actcgtgacc tccagcctgc cccttgctgc cgtgtgctcc 1920 
tgccttcctg atcctctgta gaaaggattt ttatcttctg tagagctagc cgccctgact 1980 
gccttcagac ctggccctgt agcttttctt tttctccagg ctgggccgtg agcaggtggg 2040 
ccgttgagtt acctctgtgc tggatcccgt gcccccactt gcctaccctc tgtcctgcct 2100 
tgttattgta agtgccttca atactttgca ttttgggata ataaaaaagg ctccctcccc 2160 
tgcccctcaa aaaaaaaaaa 2180 
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<210> 31 
<211> 1311 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte lb No: 



1302741CB1 



<400> 31 
* ccgccgccca 
cgccccaatc 
ccagtctccg 
-gttgtcctcg 
acgctgcaga 
agtctggctt 
aagaatgcta 
aacctgaaga 
acgtccatcc 
ggtaggctta 
tggaataact 
gaggatgtca 
ttcctcatgc 
tcctattcag 
gtgaagccca 
caagtggtgt 
ttcactcage 
cctcctggaa 
tgtggagatg 
agcacgaaag 
ctgcacaaaa 
atgttatatg 



cccgagccta 
cttgcgtgtg 
acttccattt 
ccctgctgcg 
gctcgtgggt 
tcgtctacgg 
tgctggactt 
aaaattggag 
agaataacta 
tcaaatatgg 
tatacattgc 
ctcttagatc 
tccccgaaag 
gtgactttcg 
atatagccca 
ataaaagcca 
tgatgacatt 
aaaacagaga 
tggtgcgact 
gcatttttac 
tgtggaaagg 
tagatgaata 



tctgggctgc 
cttgcagtcc 
cccaccctaa 
ctgagtgtcc 
gaccttccgc 
ctccggggtg 
tgtgttcaca 
tcactactct 
tggcgctgga 
agttattagc 
tggacgactc 
agccctcgat 
cttttctgaa 
gatggtggtt 
ctttcgagag 
gcaaggctgg 
gcccaaaacc 
tgtggaagaa 
agggctttca 
tgctggcctg 
gtggctgagg 
aagtgtttga 



gtcttctcgc 
caccccacac 
accgcctacc 
cctgttagcc 
aagatcctgt 
taccgccagg 
gtagatgacc 
ttcctaaaag 
gtttactaca 
actaacgttc 
caaaaaccgg 
agaaatctga 
gaagacctct 
ggagaagata 
ctctatggca 
ctggagatag 
ttacagcaac 
actttattcc 
gcaatcgtga 
aagaagtcag 
aaaacatcct 
tcctttttga 



cgctgctctt 
tcagccttgt 
cggtgtctgt 
tcgaccccat 
ctcacttccc 
cagggccgag 
ctgtcgcatg 
ttttagggcc 
attcattgat 
tgattgaaga 
tgaaaattat 
agagtgctgt 
tcatagagat 
aaacaaaagt 
gcatactaca 
ataaaagccc 
agataaatca 
aagtggctca 
gaccgtctag 
tgatttatag 
gattttgctt 
caaaaaaaaa 



cgtggcccaa 
gtccctcgat 
tccccgcccg 
ggcgctgcag 
cgaggagctg 
t tcagaccag 
gcattcaaag 
caagattatc 
catgtgtaat 
tctcctcaac 
ctcagtgaac 
gaccgctgct 
tgccggtctc 
gttgaatatt 
ggaaaatcct 
agaaggacag 
tattatggac 
tgatcccgac 
tataagacag 
ttcactaaaa 
gcttttatat 



60 
120 
180 
240 
300 
3 60 
420 
480 
540 
600 
660 
720 
780 
840 
9 00 
9 60 
1020 
1080 
1140 
1200 
1260 
1311 



<210> 32 
<211> 921 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__f eature 
<223> Incyte ID No: 



1541028CB1 



<400> 32 

ggcccctgcg 

gacaactctc 

gggtggctga 

tttgtgccgc 

attttgcggg 

ggctggtccc 

tgccaaccca 

agctttatga 

cggctcgagg 

atccctcctg 

tgtctactgg 

gggcctcgag 

caggccctcg 

caggctggag 

tgcgggggtg 

tgaatcctca 



ccccacgtgg 
ccccctcccc 
gctgagaggg 
gggacgccca 
acttcggggc 
cacctctggc 
gcaactgccg 
gtgccaacac 
gccgctgtgg 
gctgtgagct 
tggacgactc 
tggtcttcat 
actttgtctt 
agacactgcg 
ggcgggggcg 
aaaaaaaaaa 



cgccctttcc 
cggcacccgg 
ctccgggaag 
gcggcacgac 
tgtgagctgg 
ccccgggtgc 
ccggtgcccg 
cttcggcaat 
gcccaccaat 
ggtggtcggc 
ttttctacac 
cgtggacctc 
cgccccagac 
ctcagggacg 
gaggatggga 
a 



gggcgggggg 
ctgatttgct 
gagtgacgtc 
gtggagctcc 
gacttctcag 
taccagctcc 
ggggcctatc 
gccggctttt 
gcccgggtca 
ggtgagcccc 
acagtggctc 
tggcacccca 
ccttgaagga 
gcttgatggt 
actggctagt 



ggtggggagg 
gtgccactgg 
aggacctgcc 
tggagagcag 
ggactacccc 
tgctgtacca 
gggcactgag 
ccgttctcct 
gatgccatct 
agtgctgggc 
acaatggctc 
acgtggcagg 
aggtgctccc 
agccaggacc 
gagcac tgaa 



cacctccagc 
gagggttcgg 
ttcagccccc 
cttccctgcc 
tccgcctcgg 
agcaggccgg 
ggggcttcga 
goctggggcc 
gggcctaaag 
tgaggggcac 
ccccgaagat 
ggctgagcgc 
ttcacacacc 
tcctctctac 
atataaattc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660. 
720 
780 
840 
900 
921 



<210> 33 
<211> 2032 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 1597687CB1 

<400> 33 

agaaaccagg cggcgcgttc ccggtggcgg cgccctggac tcccgggccc gcgcatcccc 60 
gccagccttc cttaaggcgg atgggtggcc cccgagaccc cgtcggaccc atggtttcca 120 
gtgcagcgcg gagtgggcga tgccagcgtg ccaggagcca tgtctgacca ggacgtttgg 180 
aagatcatat ccatgccaga ggctcttgtg aggagatgag ttggtaaaga gagaggctgg 240 
gatgaagatg ctgccaggag tgggcgtgtt tgggactggc agctccgccc gagttctggt 3 00 
cccactgctg agggcagaag ggttcactgt tgaggccctg tgggggaaga ctgaggagga 360 
ggcgaagcag cttgctgagg agatgaacat cgccttctac accagccgga ctgatgacat 420 
cttgctgcat caagatgtgg atctggtgtg catcagcatc ccccctccac tcacccggca 480 
gatatecgtg aaggctctag gtattgggaa gaatgtggtt tgcgagaagg cagcaacatc 540 
ggtggatgcc ttccggatgg tgacagcctc gcgctactac ccgcagctca tgagcctggt 600 
agggaacgtg ctgcgcttcc tgcctgcctt cgtgcgcatg aaacagctga tttcggaaca 660 
ctatgtggga gcggtgatga tctgtgatgc ccgcatctac tcaggcagcc tgctgagccc 720 
cagctatggc tggatctgtg atgagctcat gggcggcggg ggcttgcaca ccatggggac 780 
ctacattgtg gacctgctga cccacctgac cggccggaga gccgagaagg tgcacgggct 840 
gctcaagaca ttcgtgaggc agaacgctgc catccgtggc atccggcacg tcactagcga 900 
tgacttctgt ttcttccaga tgctcatggg tgggggtgtg tgtagcacag tgacactcaa 960 
cttcaacatg ccaggcgcct ttgtgcatga agtcatggtg gtaggctctg caggacgcct 1020 
cgtcgcccgg ggagccgacc tctatgggca gaagaactct gccacgcaag aggagctgct 1080 
cttgagggac tcgctggcag tgggcgcagg actgcctgag caggggcccc aggatgtccc 1140 
gctgctgtac ctgaagggca tggtctacat ggtgcaggcc ttgcgccagt ccttccaggg 1200 
gcagggcgac cgccgcacct gggaccgcac ccctgtctcc atggccgcct ccttcgagga 1260 
tgggctgtac atgcagagcg tggtggatgc catcaagagg tcgagccgat ccggggagtg 1320 
ggaggctgtg gaggtgctga cggaggagcc cgacaccaac cagaacctgt gtgaggcact 1380 
tcagcggaac aacctatgag cctgcacctg ggctccttgc cacagggcag agggaccagg 1440 
gaggggaaca ggagccagac atgacaggga cttggcccta gcagagacag tgtctttcat 1500 
ttaatgagtc tgggtgaagc cagaggtggc tctgggactt cgccccctcc agtggctcat 1560 
ctgctcctca gacttgcagg gtggcaaatg ttcctgttgc caggttacca cggtgtgcac 1620 
agggcagagc tgccgccagg cctcctcatg gtgatgccag tgagtcggac caggaaaggc 1680 
tcaccctggg gagcccttca gcctgatccg cagtcctaac agggtgagaa agcacaagag 1740' 
gggccagctc ggctgaggcc tgaggagaaa tgatgagaag ttttgtttcg ttcttgggct 1800 
ggggcatctc aggaacagtg agggcaacaa gctgtccctc agggaccctc acctgccctc 1860 
ccatgtgacc aggcccttcc caggtgggga ctggggcttg ctttattaag tgagctcttg 1920 
ggcttttgaa gtaggacaga gggccctggt ttggcagaga ggaggcaagg gagggcttta 1980 
gaacatctga gaaatgttaa taaataattc agaaaataaa aaaaaaaaaa aa 2032 



<210> 34 

<211> 1134 

<212> DNA 

<213>, Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1690348CB1 

<400> 34 

cgtggacttc taagaaagcg ccatggcctg tgctgctgtt atgattcctg ggttgttgcg 60 
gtgctctgtt ggagccatcc gtattgaggc tgcgtcactg agattgacac tcagcacttt 120 
gcgccatctt actctaacca gcataatgaa atccaaaagg aaaactgatc acatggagag 180 
aactgcaagt gtccttcgac gggagattgt ggcagcagct aaggtgtgtg gagctgccag 240 
tgagtcaccg tcagtgaaga gcctccgctt gcttgttgct gatcaagact tttcctttaa 3 00 
agctggccag tgggttgatt tctttattcc aggagtctct gtggttggtg ggttttcaat 360 
atgctccagt cccagactgc tagaacaaga gagagtgata gaattggcag tgaaatatac 420 
gaaccaccct cctgccctct gggttcacaa tacgtgtaca cttgactgtg aagtggctgt 480 
gagagtgggt ggagagttct tctttgaccc tcagcctgcg gatgcctcta gaaacctcgt 540 
gttgattgca ggaggagtcg gaattaaccc tctgctttcc atcctgcggc acgcagcaga 600 
tctcctcaga gagcaggcaa acaaaagaaa tggatatgag ataggaacaa taaaactatt 660 
ctacagtgca aaaaatacca gcgaactcct gtttaagaaa aatatccttg atttagtaaa 720 
tgaatttcct gagaagattg catgcagttt gcatgttaca aaacagacta cacaaatcaa 780 
tgcggaactc aagccataca tcacggaagg aagaataacg gagaaggaga taagagatca 840 
tatttcaaaa gagactttgt tctatatttg tggcccacct ccaatgacag actttttctc 900 
caagcaactg gaaaacaacc atgtacccaa agaacacatt tgctttgaga agtggtggta 960 
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ggaggcagac aaaggcagaa aaaataaaga ggtgagatct actcaggaga gcrfccctgtcc 1020 
tttgtggcat gattaatttt ttttatctct acttgagttg tcttattttt taaggctata 1080 
aacttagtga ccagctggat aataaaagcc agctggcaga cttaaatgat aaac 1134 



<210> 35 

<211> 734 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> 661 

<223> a or g or c or t, unknown, or other 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



1865603CB1 



<400> 35 

gggaggaccc 

ccagtcaaga 

cctcccccat 

gacaccacca 

cggaaagcac 

gaacccaaag 

cctttggagg 

gtccccagaa 

aatctgattg 

gtcccccttg 

ctccaaagac 

nttgggacca 

ctcaatgaaa 



tgggcaaaga 
tgtggggcag 
gtcttatccg 
tgttttattc 
tgtgttttgg 
ccgctcaccc 
aaatgatcca 

caggggcaaa 
aggtgtccaa 
atgtcgccct 
tgaggactta 
aacctacatg 
aaaa 



cacctaccct 
gactttggag 
tagacttgac 
caagatcctg 
agaceagaaa 
agttcctggc 
gcacctcaag 
agggcctatc 
ctacatctcc 
ctcctttcct 
ggcacttcac 
tctgtatgtc 



gccatgctgc 
aaacagtcat 
cacatcgtga 
ggcatggagg 
tttaacctcc 
tccctggaca 
gcttgtgatg 
atgtccatct 
tcgtgatgga 
tcctgcaaac 
acatcctgct 
atcaaagttg 



gccatctgcc 
ggagggacag 
tgacggtgaa 
tcatgacttt 
acgaggtggg 
tatgtotgat 
tccctattga 
acttccgaga 
ggctggacct 
ccccacccag 
gaggggggac 
gcctaataaa 



ctccaggctg 
cagtcagacc 
gagcatcaaa 
taaggaagac 
aaaggaattt 
cacagaggtg 
ggaggggcca 
ccccgacaga 
cctccattct 
gcccagagat 
ccaagacagg 
tgcataacct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
734 



<210> 36 

<211> 2221 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc. feature 
<223> Incyte ID No: 



1976472CB1 



<400> 36 

gtcgggggct 

cggcgatccc 

tctttcccgg 

taaaacaggg 

cgggattaaa 

aagatgattg 

atcatcctgg 

ttgatcaggt 

gaatggccat 

atggcatgct 

caagctatga 

aggatatcaa 

caattattaa 

atttttctgt 

agaatacttc 

ggaaagatac 

atgatacgag 

ggcaacatgt 

ctgtatctgg 

tctacttttt 

agattggaga 

aagaattaga 



tggcctctgc 
cgggcagggc 
cccccaggtc 
cagaagcctt 
aggcaggtta 
ttggatatgc 
tggagaagat 
tcatcgttgg 
taaacctgct 
tcccaagagc 
ttggttccaa 
tttagactca 
ggattgttta 
gcgggttgtt 
ttgggacttt 
aggtttgtac 
gcttttctgt 
ttacctcaag 
ttccttactc 
gataaaaatc 
ttttgtttct 
agatctcttt 



ccggccacag 
ccggggccgg 
gcagcagcgt 
atggattgga 
attgaagtaa 
ataagaggtt 
gaactaatga 
gtcaattatg 
gttctgaaag 
caagtgacag 
acagactctt 
ataatagttg 
tatcttatac 
gagagtgtgg 
cttggccatc 
tacagaaagt 
ttgatgctgc 
ctacctatta 
tcagagttca 
tafccccactg 
gtaagcagtc 
ttgttggcag 



agccggagct 
ggtttgaaga 
gtcgcctccg 
ttcgactgac 
ctgaagaaga 
tcgtttataa 
gagcagcagg 
aatccatgct 
actatcgtga 
atacacttgc 
tagtcaccat 
atcatcagaa 
atattgggct 
gaaaaataga 
ccctgaagaa 
gocagttaat 
caccaagcac 
caggtacaga 
aggaaccagt 
gactcttcac 
ctgagggcaa 
ctggaacagg 



ggaggt^rctg 
tgctgaacgt 

gggggcgtac 

caaaagtgga 
acttaagaaa 
tgtcagccct 
atcagatggt 
gaaagaatgc 
ggaggaaaag 

caaagaaggt 
t<yccatatat 
tgattccttt 
aagccatgag 
gattgttcta 
tcataattca 
ttccaaggaa 
tcatcttcaa 
aatagtaaag 
tcttcccaac 
accagagctt 
ttttaaaata 
cttcacacca 



tcccgtctgg 
cccttcccag 
aaggtacctt 
aaggatctaa 
cacaacaaaa 
tatatggagt 
actgaaottt 
ctggttggca 
aaagtcttaa 
cctagttatc 
act aaac aga 
agagcagaaa 
gttcaggaag 
caaaaaaaag 
cttattccaa 
gatgttacfcc 
gtgcccattg 
ccatatacac 
aataaataca 
gatcgtcttc 
tccaagttcc 
atggttaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
9 00 
960 
1020 
1O80 
1140 
1200 
1260 
1320 
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tactgaatta 
ataaaacaga 
aaagactgga 
gacatatttc 
ttctcgtctg 
atctcaactt 
tcctttattc 
tttgtacata 
gttccttcag 
atatttgacc 
cacaaatttc 
gacatgataa 
caatacagat 
ttattaatgt 
tgtgtatggc 



tgctttgact 
agatgatata 
tgttgaattt 
accagctctt 
catttgtgga 
ttccaaaaat 
aactagttta 
acaaaaggtt 
tacaggtaac 
tggaaagtta 
cttgccattt 
agtaaatgat 
taaattttac 
taaaacatag 
attgatgcag 



gatataccca 
atttggagaa 
gttctctcag 
ctttctgaat 
ccagtgccat 
gagatccata 
tctaaatttg 
aactagaatc 
ttcttggctt 
atcatggcaa 
aaattatatc 
cacttcgatc 
cttgcactgt 
aatgttataa 
aatagaataa 



gtctcaggaa 
gccaattgga 
cacctatttc 
ttttgaaaag 
ttacagaaca 
gttttacagc 
tgattgctta 
cagccttcag 
tcttttgtac 
caaatacata 
actgttctac 
atgtttctat 
taactcagga 
cttattaagt 
aattatactt 



agtgaagctg 
gaaattagca 
tgaatggaat 
aaatttggac 
aggagtaagg 
ataatgaaga 
gggtttttta 
tttcttaaat 
cacaacttat 
caggattctt 
aataagcact 
gctgtaaaat 
aatgatcatt 
gatccaaaca 
aagttctttt 



atgttcttca 
tttaaagata 
ggcaaacagg 
aaatccaaag 
ttgctgcatg 
gctgtcattg 
agagaacatt 
gaaatcaaat 
tttactactg 
tgttatgaat 
tgtgttttat 
gtcttattag 
tatgtccttg 
tttttttgtg 
taaaaaaaaa 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2221 



<210> 37 

<211> 1706 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2050821CB1 



<400> 37 

agaagaggtt 

tggctgggcc 

tcctggctac 

ctccagtgtt 

actcctacag 

tttacggtat 

cggtctatca 

ctgaagccct 

cgtcggatgc 

aacaagagtg 

cgtctggaca 

ttcagctttg 

ctcagtgccc 

tacctctccc 

gacgctgtca 

aaagacaaag 

gaagatggga 

ggaggccatg 

ccagaatacc 

aaagagattg 

agcctgaggt 

ctcccagatg 

catcacaacc 

gagaacagca 

tgcatcgggc 

cacttccggt 

gccgagggcg 

gacccatcca 

aaaaaaaaac 



gtgtgggaca 
tcagaccggt 
tcgcccgcat 
tcccacagcc 
aggagggctt 
ggctgggtcc 
ccaatgcctc 
ggctgggaga 
tgacgcccgc 
caaacatcat 
tgtttgagca 
acagccattg 
ttgtagagaa 
atgacgggcg 
tccgggagcg 
ccaagtccaa 
aggcattgtc 
acaccacggc 
aggagcgctg 
aatgggacga 
tacatccccc 
gccgagtcat 
caactgtgtg 
aggggaggtc 
aggcgttcgc 
tcctgccaga 
ggctttggct 
cctgtttttt 
taaaaaaaaa 



agctgctccc 
ggcaatgtcc 
cctggcttgg 
cccaaaacgg 
gaaggactcg 
catcatcccc 
agctgccatt 
agggatactg 
cttccatttc 
gcttgacaag 
catcagcctc 
tcaggagagg 
aagaagccag 
gcgcttccac 
gcgtcgcacc 
gactttggat 
agatgaggat 
cagtggcctc 
ccgacaggag 
cctggcccag 
agctcccttc 
ccccaaaggc 
gccggatcct 
acctctggct 
catggcggag 
ccacactgag 
gcgggtggag 
tgcagattgt 
aaaaaa 



gacagaagga 
ccatggctac 
acctatgcct 
aactggtttt 
acccagatgt 
ttcatcgttt 
gcacccaagg 
ctgagtggcg 
aacatcctga 
tggcagcacc 
atgaccttgg 
cccagtgaat 
catatcctcc 
agggcctgcc 
ctccccactc 
ttcattgatg 
ataagagcag 
tcctgggtcc 
gtgcaagagc 
ctgcccttcc 
atctcccgat 
attacctgcc 
gaggtctacg 
tttattcctt 
atgaaagtgg 
ccccgcagga 
cccctgaatg 
catgaataaa 



tgtcgctgct 
tcctgctgct 
tctataacaa 
ggggtcacct 
cggccaccta 
tatgccaccc 
ataatctctt 
gtgacaagtg 
agtcctatat 
tggcctcaga 
acagtctaca 
atattgccac 
agcacatgga 
gcctggtgca 
agggtattga 
tgcttctgct 
aggctgacac 
tgtacaacct 
ttctgaagga 
tgaccatgtg 
gctgcaccca 
tcatcgatat 
accccttccg 
tctccgcagg 
tcctggcgtt 
agctggaatt 
taggcttgca 
acggtgctgt 



gagcctgccc 
ggttgtgggc 
ctgccgccgg 
gggcctgatc 
ttcccagggc 
tgacaccatc 
catcaggttc 
gagccgccac 
aacgatcttc 
gggcagcagt 
gaaatgcatc 
catcttggag 
ctttctgtat 
tgacttcaca 
tgattttttc 
gagcaaggat 
cttcatgttt 
tgcgaggcac 
ccgcgatcct 
cgtgaaggag 
ggacattgtt 
tataggggtc 
ctttgaccca 
gcccaggaac 
gatgctgctg 
gatcatgcgc 
gtgactttct 
caaaaaaaaa 



60 
120 
180 
240 
300 
3 60 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1706 



<210> 38 

<211> 549 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> incyte ID No: 2408443CB1 
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<400> 38 

cccacgcgtc cgtgtcatgt acttttgtta 
tcaaggaggg caacatttta tggttacatt 
taaagcaaaa caatggctgg ttgaccataa 
agaaccattg cgtgcggaag atgtcaaagc 
agaattaatt tcaacacgct caaaaatttt 
gtcaattaat aaattgattg acctcatcgt 
tattcttgac gatcagcgca tgcaaattgg 
accacgtgaa gttcgtcagc gagagttaat 
agaggcaaaa gatttagtgg ttgaagaagg 
tgacaaaag 



atatttttgc tataataaac ttagtatatt 60 
atactcgtct ccgagttgca cctcttgtcg 120 
tctcccattt attgaacgta atttaaataa 180 
aatgttacga ttgactgaag atgggacgga 240 
ttctgagttg acgattgact tagatgatat 300 
catgtatccg tctttactga agcggccaat 360 
gtacaatgat gatgaaattc gtcgcttttt 420 
tcgcgcaaca tttaaagctg acttcgcaga 480 
ctgagcacgt cagtcttttt ttctttacca 540 

549 



<210> 39 

<211> 1363 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2508668CB1 



<400> 39 

cgtcaggctc 

gaacccccgc 

cctcctccta 

acggggatcc 

cagatgtccg 

aaggaaacta 

aggtggtagc 

caaatgctgg 

aagttccgag 

cctacaggat 

catccaacag 

ccatcaatgt 

tgggggctga 

ttaaggacat 

agctgctgcg 

agcccgtcgt 

ggttgtccca 

tgtgcgatct 

aggactacca 

tcaccatgtg 

gctgggtgca 

gcctcttcct 

ggtatctaat 



tgtgttggtt 
ccggcagtgg 
ctccgcatcc 
agccaaggtc 
tgtgaagatg 
cggactcctt 
ggtgggcagc 
tttaggaacc 
tgacatccct 
gttgatggac 
cggagtgggg 
ggtocgagac 
gcatgtcatc 
gccccagcca 
gcagttagcg 
agcctctgtg 
gtggaagaag 
catccgccga 
gtctgccttg 
atcatcccaa 
ggcccctcag 
attaggagga 
aaagtctgaa 



ggagcgagca 
cgggggctgc 
gccgagcctg 
gtcgaactca 
ctggcggccc 
cctgaactgc 
aatgtgaccg 
tggcggaccg 
cttcagagcg 
ttcgagcaac 
caagcggtca 
agacctgata 
acagaagagg 
cggcttgctc 
cgtggaggaa 
agcctgctca 
gatcacagtc 
ggccagctca 
gaagcctcca 
aagagctgga 
ttggggctcc 
tggtgaagcc 
ctctcccttc 



tgtggigtctg 
tcccagcttc 
cccgggtccg 
agaacctgga 
ctatcaatcc 
ctgctgttgg 
ggctgaagcc 
aggctgtgtt 
ctgccaccct 
tgcagccagg 
tccagatogc 
tccagaagct 
agctaagaag 
tcaactgtgt 
ccatggtaac 
tttttaagga 
cagaccagtt 
cagcccctgc 
tgaagccctt 
gtgacatggg 
caccttcccc 
agccacggtt 
caaaaaaaaa 



cagtaccctg 
tggctgtcac 
ggcgcttgtc 
gctagctgct 
atctgacata 
agggaacgaa 
aggagactgg 
cagcgaggaa 
gggtgtcaat 
ggattctgtc 
cgcagccctg 
gagtgacaga 
gcccgaaatg 
tggtgggaaa 
ctatgggggg 
tctcaaactt 
caaggagctg 
ctgctcocag 
catatcttca 
aggggaggcg 
agactactgt 
ttccccaggg 
aaa 



tggcgggtgc 
ggacctgccg 
tatgggcacc 
gtgagaggat 
aatatgatcc 
ggtgttgcac 
gtgattocag 
gcactgatcc 
ccctgcacag 
atccagaatg 
ggcctaagaa 
ctgaagagtc 
aaaaacttct 
agctccacag 
atggccaagc 
cgaggctttt 
atcctcacac 
gtcccgctgc 
aagcagattc 
gatctgaggg 
tctcctcact 
ccagccttaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020' 
1080 
1140 
1200 
1260 
1320 
1363 



<210> 40 

<211> 1196 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 253683 0CB1 

<400> 40 

ggttgccagg caaccgcgaa ccgcggtccc tgcctctgag tctctcfccga ccatggccaa 60 

attcgtcatc. gcgggtagag cagattgttrc atattatgct aaaacagaac ttgtggcaga 120 

ctatttacaa aagaatcttc ctgattttcg gatacataaa atcacacaac ^gtcctgaggt 180 

ttgggaggat tggctaaaag atgtgtgtga aaagaataag tggagtcaca agaattcccc 240 

tatcatctgg agagagctgt tggatcgtgg aggaaagggt ttgcttttgg gaggatataa 3O0 

tgagttcctg gagcatgctc agctttacta tgatgtoacc tctagcatga cgactgaact 360 

gatgatggta attgctcaag agaacctggg ggcacatata gaaaaagagc aggaggaaga 420 

agccctgaaa acttgcatca accccttgca ggtx:tggatc accagt<jcct ctgctcctgc 480 

ctgctacaac ctaattccca tattgacgag tggcgaagtg tt-fcgggatgo atacagaaat 540 
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tagcataact 
gacccaagac 
ggccttccgc 
cactctggag 
agagaaaaat 
gaagacagtt 
gctgggggtg 
atgtgagtga 
agctaggaag 
aacagacata 
cttttgtgtc 



ctatttgaca 
ctggcatctc 
caggcccacg 
gactgcctcc 
gctcatgagt 
ttactcatga 
gaaggtgaag 
atgtctcttt 
aacactgtcc 
aagaacttat 
tggctttttc 



acaagcaggc 
ccgtcctgcg 
tcattgtggt 
gaagcagggt 
ctgtcagagt 
gatatgcccc 
cgaaagccat 
atgtttttgt 
cctctaggaa 
tgtgtgattt 
catcattata 



ggaagaacat 
cagtgtctcc 
gctggatgac 
gcctctctgc 
catcgtggga 
acgcattgca 
actggccaga 
ttttattttg 
cattctgagt 
gactactgtg 
aaaatcaata 



ctcaaaagcc 
atctgcacga 
agcaccaaca 
aggctctatg 
gggagaacct 
cacaacatta 
aaactgaaga 
aaacaacatg 
atctggaata 
caaacatttt 
aatatttaat 



ttgtggtgga 
aggtggagga 
aggaggtgtt 
ggtacctgat 
ttgtaaacct 
ttgctgtggc 
cagctccttc 
atttcaaagc 
ggaagtataa 
tttttactgg 
aagaaa 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1196 



<210> 41 
<211> 1926 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



2645179CB1 



<400> 41 

gcttcctctc 

gagtccggca 

ggcggcggca 

gatattctct 

tctaaaatat 

tttcccaaca 

attgtaacag 

aaaccaaagt 

gctcaggaat 

ggtattgcac 

gataaagcta 

ctcattgctg 

ggaaggaaaa 

ccagattggc 

cttgaaacta 

aaaaagtcct 

cctgtctatg 

acaggagtta 

gatggtggcc 

ggtgacatct 

acccaggcta 

gactctattg 

tataaggttg 

atgctgagat 

atgggagctg 

aaccaaatga 

gattattttg 

gaaaaatcac 

attacagaag 

atcagttcaa 

aaatcttaag 

ttggagcaaa 

ttcttt 



ctggagtcca 
gcatggaggc 
tcgcgggcgt 
tggtaacagc 
tggaagaatt 
ttaaggttat 
aagatggcaa 
tgatatgtga 
ttcagaaaca 
ttgagttagt 
tagggaatac 
aaaaatcaga 
aggaagctag 
atgaaggctt 
tgtgtgaagt 
tcacttttcc 
tggaattgac 
caccaaatgt 
tgaaagtgga 
gtactacatc 
gacagatggg 
acatggattt 
tactgctggg 
gtaccaaagg 
tcttaattgg 
atctttcatc 
actaaaaatg 
aagtcaataa 
tgataatgat 
agtttattta 
ttatttattt 
tttaggtaag 



gagtcccgct 
agcgcgccct 
cacttgtgcg 
ttctcctgtt 
cgatgttgaa 
agaatctggc 
tcagcacgta 
aggaaatcct 
gcttactaaa 
gtatgaaatt 
tttcttcgat 
ggctaaaatt 
aagcaaatct 
gaatcttaaa 
aaagaaaatc 
aagagaccat 
caatgaaaag 
agaacctttt 
tgatcatatg 
ctggcagctg 
atggtatgca 
cagctttgaa 
aaaatacaat 
acgagaatac 
tgaaaccgat 
atatggagaa 
gaatttcttc 
aatgaatgac 
attagtggaa 
tagatatatc 
ctgtgtttta 
ttatctactt 



gctccgccgc 
cccccgacgg 
gagcagttgg 
attaaagcag 
gaacaatcaa 
gtaaagcaac 
tataagaaac 
tatgtattag 
gctaaaagaa 
gaaggctgtg 
gcaggagcag 
gcacataaaa 
aaagcagata 
ggaacaaaag 
taccttcagg 
aagtcagtta 
atatatggct 
ctccatggta 
cacacatccc 
agcccagtct 
gcaaagtgca 
ctgtttgctc 
gcacagggct 
atcaaagtcg 
ttagaagaaa 
gatctgctag 
aggaatcata 
tgtattgagt 
aaatataaaa 
tttccaatac 
aacataaata 
agccaaatgt 



gatattcagt 
cagggaagtt 
ctactcactt 
ttacaaattt 
gtaccatgtt 
tgaagagtga 
tctgtctgtg 
gaatccgtga 
taatgatcat 
aagtgatttg 
ctgaattctt 
gaaccagata 
atgtaggaag 
agttttctca 
atgagtttag 
cagctgatac 
gcgatttcat 
acagttttga 
ttcctgatat 
ggcagcagat 
tggctgcagc 
atgtgacaaa 
taggttcaga 
tcatgcaaaa 
catttgaaaa 
atccaaatat 
taaagttcca 
taatgatgae 
acataaattc 
aacactgacc 
tgtttacttg 
actctagtag 



aaaccactgg 
cgtggtggtc 
tccatcggaa 
caagcagatt 
aggaaaacgc 
agaacactgc 
tgctggagct 
tacagacagt 
agggaacggt 
ggccattaaa 
gacttcaaag 
tacaactgaa 
tgcattggga 
taagattcac 
aattttgaag 
agagatgtgg 
tgtcagtgct 
tctaggagaa 
ctatgctgcc 
gaggctgtgg 
gagttcagga 
attttttaac 
tcatgaatta 
tggacgaatg 
cctaatctta 
tgatatagaa 
aatgacacca 
cacactgaaa 
taagtttgaa 
gcttagataa 
tgatttagct 
actagaacca 



60 
120 
180 
240- 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1926 



<210> 42 

<211> 1727 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



2754425CB1 
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PCT/US00/33158 



<400> 42 

agggaccgac 

cttttgccgg 

tggcagtatc 

cccacgctaa 

ttgcttcaca 

agctggtcgc 

tgcgacacca 

tatcacgtgg 

gccgatgcca 

tgcacccaca 

cgctactcct 

ctcaacagcc 

acgccctatt 

cagaccatcc 

caaggatgga 

cagggggttt 

agccttcgga 

cctgggactg 

ggtctgaaaa 

gaagggcgct 

gcagaggatg 

gccgaacggc 

tggagtcggc 

accggtgttg 

atatcatgtc 

ttccttagat 

acagcgcatt 

tcttctgttg 

tctgaatata 



ggcacttgat 
aaagtggagc 
tgaggatcgg 
tctgcatggt 
cgcccagtaa 
tggatggcct 
ccgctgtcat 
gcagggagca 
acgggctgca 
acgagggcct 
ggtccggcat 
ggccctccat 
tctcccggga 
tgtgtgagtt 
aaatgaacag 
gtgttcccag 
cctgtattgg 
tgataacgga 
ctccaaatgg 
gccttctctt 
gcccacgggt 
aggctcttga 
gagaagggcc 
tttccatgct 
gggtccctct 
tttttttttt 
cctttgattg 
ctacttgttt 
ataatgtaaa 



aatgtgacaa 
agtgggcacg 
tggcagccat 
gtgggcgccc 
ggactccccc 
gagggactgc 
cac tgtggcc 
gccccggccc 
gaatggctac 
caaccagaag 
gggccgcatc 
ccagaagccc 
cgcacagaaa 
tgagaccctc 
cacccccagc 
gaactgtagg 
gaacaatgtt 
gcactatgga 
ctgtgagctg 
tgatgactct 
ggttttcatg 
tttcatcttt 
gaggcggggc 
cagaaacctg 
ttcccttggt 
tccttccaat 
gtccttgagt 
ttttcagtgc 
acttcttgtg 



aaaccagcct 
gccaaccttg 
gcctccccct 
ttgggacccc 
aagatgtcgc 
atcgccaccg 
tgcctcctgg 
tacgtctccg 
gtgtactgcc 
ctgtaccaca 
cacaagggca 
gaggtcttct 
catgatgtgg 
tacaaagctt 
ggggagtggt 
aagtgcccac 
tttgggaacg 
cccaccaaca 
gtggtggggg 
ttcctgcatg 
gtggatttgt 
gctccgggac 
ctgggcagac 
cctcagcgga 
tattgtaaat 
catttgcttc 
gaccagagac 
tctgaaatag 
gtcatcttaa 



cttccacccc 
tcgttcatca 
gccccagccg 
cgaggactga 
tcgagtggct 
gcatccagtc 
tcctcttcgt 
tcaactccct 
agtcccctga 
acctgcagga 
tccgcgagca 
tcctgcccga 
aagtgctgga 
tctcaaactg 
tcacctttta 
ggacgtaccg 
cgtgcatctc 
tccgcatccg 
gagagcccca 
ctgcgttcca 
ggcatccaaa 
gatgagagta 
tgtggtccgg 
aagctcttat 
ggaaactttt 
agagac tcct 
ttagtgocct 
agtaactaaa 
aaaaaaa 



actgcagtga 
atgccgcccc 
ctc-cttcccc 
ttgtctgacc 
ggtggcctgg 
cgtgcgggac 
gtggtactgt 
catgcaggct 
gtgcgtgcgc 
gtacgccaag 
gggccggtac 
cctgcccacc 
acggaacttc 
cagcctcccg 
cttggtcaat 
cttgctcgga 
tgtgctgagc 
atgccattta 
gtgctgggca 
tgaaggttca 
cgtcgcagcg 
tttcccatgc 
tc-cagtccct 
tt^gggatttt 
cggcttgtat 
ttctggccta 
tgtaagtctg 
tggttatttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1727 



<210> 43 
<211> 611 
<212> DNA 

<213> Homo sapiens 

<220> . 

<221> misc_f eature 

<223> Incyte ID No: 2821526CB1 

<400> 43 

cccatccggt gggttttgct cggcgcaccc gggaggcggg gccagcgagg caagatggag 60 
ttagtgcagg tcctgaaacg cgggctgcag cagatcaccg gccacggcgg tctccgaggc 120 
tatctacggg tttttttcag gacaaatgat gcgaaggttg gtacattagt gggggaagac 180 
aaatatggaa acaaatacta tgaagacaac aagcaatttt ttggocgtca ccgatgggtt 240 
gtatatacta ctgaaatgaa tggcaaaaac acattctggg atgtggatgg aagcatggfcg 300 
cctcctgaat ggcatcgttg gcttcacagt atgactgatg atcctccaac aacaaaacca 360 
cttactgctc gtaaattcat ttggacgaac cataaattca acgtgactgg caccccagaa 420 
caatatgtac cttattctac cactagaaag aagattcagg agtggatcoc accttcaaca 480 
ccttacaagt aaagacaatg aagaacagtt gaaacatgca aaatatggag cttttcatgt 540 
aattactctt ttactgttta ccattcacta taattcacaa ttaaaattgt gtgactaaac 600 
aaaaaaaaaa a 611 



<210> 44 
<211> 1352 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2876494CB1 
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<400> 44 

gttttattga 

aatcaagtta 

ggcaaaacgc 

ggttggcctt 

agctgcaaga 

gccttggcta 

gtcagactca 

ttctcattgg 

aggaagcttt 

tcttcaaacc 

actctgggac 

tctattttgg 

ttactgagga 

agcctcacca 

ttggagtggt 

agcagtgaca 

ccaagctagc 

actcccacag 

cgcagtaacc 

gtacccagga 

tcattttcta 

gtgtattcat 

taattcacag 



caatacatgc 
tgatttatag 
ctaccgagac 
ctacctgcct 
agctgtgggg 
taaggaggag 
gcgcttagaa 
caaggaattt 
ccggttcacc 
ctttcaagcc 
atatcgctgt 
gttgagggtc 
tcagaagtta 
cccaaagtgg 
tggtggcgtg 
gcttcaagaa 
tttttagggg 
ccaaaagaat 
taacttcctt 
aactgcaagc 
ccctttatga 
gtagagggat 
tagctgtatg 



atcatatctt 
aggattcagc 
atgaaggttt 
ttggtggtga 
aaagttatca 
accgtctgtg 
tgtctgacca 
gagcttagct 
tggagacttg 
aactcccact 
gatgtgcagc 
cttcctccta 
atagatgagg 
aaaaagaagg 
ttggtgagga 
cttaacagcc 
agtgttccag 
gagtgggaga 
atgtcccatg 
attgcctgtg 
cttggatgcc 
gtattcagcc 
gagacctttt 



ttgactttga 
tggaatacct 
tagccactag 
ctacacctaa 
tcaatgccac 
aggtgggccc 
actggatctg 
gtctgagttc 
ctcgaggtgt 
ttgtgaagtt 
tggtaaaaaa 
acttggtgaa 
gattggaagt 
tggcgtcagc 
ttgtcctctg 
ttgctcctga 
ctgctggtag 
aatggagggg 
gatctcttcc 
ttcctgggaa 
tccccacctc 
tttttagtga 
tc 



aggatatctc 
tgtgggtgct 
ttttgtcctt 
aacactggcc 
aacctgtact 
tgatggagtg 
tgggatgctc. 
agacatcttg 
catctccact 
taaatatgct 
cttgagactt 
tctgaatttc 
taatctggac 
cttgggaata 
tgcgctaagg 
agaactggct 
tggatcagct 
acaatctcct 
tgatcttcce 
gagttctaag 
catttcccct 
acattttttt 



atgtcaaagg 
ggctgagggt 
gggagcctgg 
atccctgaga 
gtcacctgtg 
agaaggaaat 
catttcacca 
gagtttggac 
gacgatgagg 
caggagtatg 
gtcaagaggc 
catcagtcac 
agctactcca 
ggaattgcca 

gggggcctgc 

gcccaggaag 
tagagggaac 
gggagctatg 
tgcccattgg 
aagcttgcat 
cttctgagct 
tcaataaaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1352 



<210> 45 

<211> 1458 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3403225CB1 



<400> 45 

ggacggagag 

tgagccagag 

ggaacgatgc 

gctccatcca 

ggcaaggtag 

ctcgctagcc 

gctgccagtg 

ctatccgaca 

ctccatattc 

ggctttgaaa 

cacccaggcg 

cggctcttct 

gccctggctg 

tgggtgtctc 

gage ttc tag 

tccatgccat 

ccagctggtg 

acacctgaca 

cttctctaag 

ggctgggcat 

agaggctcag 

caactcttgg 

aacacacaaa 

tcccatttca 

ttagagaaaa 



gagcagagaa 
ctggggttga 
tggtcacctt 
tcaggaagtt 
tggtgatcac 
gaggageccg 
aaatccgagt 
ccaaatctat 
tgatcaacaa 
cccacctggg 
tcgtccgctc 
ccccctttgt 
agggcctgga 
caagggcccg 
gaatccggtg 
aatgaacagg 
ctgcgaatcc 
ctcttgtgag 
acctggaaag 

gggggtggca 
ctgaggcaag 
ttgatctctt 
gatcctggct 
ateatgeaga 
taaagact 



gcagcagaag 
agctggagca 
gggactgetc 
ctttgctggt 
tggcgccaac 
agtctatatt 
ggatacaaag 
ccgagccttt 
tgcgggagta 
agtcaaccac 
tgagctggtc 
caagaeggea 
gcccctgagt 
aaataacaaa 
ggagtagctg 
gaccaaggag 
tgcctgctct 
actggcttat 
tcagcaaccc 
gaagagcccg 
aagagcacca 
tctttttaaa 
aaccctggcc 
tggtttcttt 



cagecaagag 
gcagcaaaag 
acctccttct 
ggagtgtgta 
aegggcattg 
gectgeagag 
aactcccagg 
gctgagggct 
atgatgtgtc 
ctgggcaccg 
cggcactcct 
egggaggggg 
ggcaagtact 
acagctgagc 
gtggaagagc 
aaggccaacc 
gatcctcttg 
ggcatgagtt 
tctgggggca 
agaaattggg 
tcactgccta 
aatatttgee 
tatttagatt 
ttgtaaagag 



ctggagccag 
caacagcagc 
tctcgttcct 
gaacaaatgt 
gcaaggagac 
atgtactgaa 
tgctggtgcg 
ttctggcaga 
catattccaa 
gggtcaccac 
ccctgctctg 
cgcagaccag 
tcagtgactg 
gcctatggaa 
tgcagcttta 
ctaaaggatt 
acccttctgg 
gtggacacct 
gcaggactgg 
tcagttccct 
tttctagggg 
accaccctgg 
ccttcctctc 
ttccgtttgc 



accaggaacc 
tacagaagtt 
gtatatggta 
gcagcttcct 
ggecagagag 

gggggagtct 

gaaat tggac 
ggaaaagcag 
gacagctgat 
etaegcagtg 
cctgctctgg 
cctgcactgc 
caagaggacc 
tgtcagctgt 
tcaggcccaa 
gtcctcttgg 
gaatgtttgc 
atagagtgtt 
gcagatccca 
catcagcacc 
ctatacactc 
agtctagacc 
acctggacct 
ctttcaattt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1458 



<210> 46 

<211> 1884 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 4163 943CB1 

<400> 46 

ggactccaag cgccatggcc gctgccgccc gagcccgggt cgcgtacttg ctgaggcaac 60 
tgcaacgcgc agcgtgccag tgcccaactc attctcatac ttactxnccaa gcccctggac 120 
tttcaccttc tgggaaaaca acagattatg cctttgagat ggctgttttra aatattagat 180 
atggagcagc agttacaaag gaagtaggaa tggacctaaa aaacatgggt gctaaaaatg 240 
tgtgcttgat gacagacaag aacctctcca agctccctcc tgtgcaagta gctatggatt 300 
_ccctagtgaa gaatggcatc ccctttacgg tttatgataa tgtgagagtg gaaccaacgg 3 60 , 
attcaagctt catggaagct attgagtttg cccaaaaggg agcttttgat gcctatgttg 420 
ctgtcggtgg tggctctacc atggacacct gtaaggctgc taatctgtat gcatccagcc 480 
- ctcattctga tttcctagat tatgtcagtg cccccattgg caagggaaag cctgtgtctg 540 
^tgcctcttaa gcctctgatt gcagtgccaa ctacctcagg aaccgggagt gaaactactg 600 
-gggttgccat ttttgactat gaacacttga aagtaaaaat tggcatcact tcgagagcca 660 
tcaaacccac actgggactg attgatcctc tgcacaccct ccacatgcct gcccgagtgg 720 
tcgccaacag tggctttgat. gtgctttgcc atgccctgga gtcatacacc accctgccct 780 
aecacctgcg gagcccctgc ccttcaaatc ceatcacacg gcctgcgtac cagggcagca 840 
acccaatcag tgacatttgg gctatccacg cgctgcggat cgtggctaag tatctgaaga 900 < 
rgggctgtcag aaatcccgat gatcttgaag caaggtctca tatgcacttg gcaagtgctt 960 
■r ttgctggcat cggctttgga aatgctggtg ttcatctgtg ccatggaatg tcttacccaa 1020 
tttcaggttt agtgaagatg tataaagcaa aggattacaa tgtggatcac ccactgrgtgc 1080 
cccatggcct ttctgtggtg ctcacgtccc cagcggtgtt cactttcacg gcccagatgt 1140 
ttccagagcg acacctggag atggcagaaa tactgggagc cgacacccgc actgccagga 1200 
tccaagatgc agggctggtg ttggcagaca cgctccggaa attcttattc gatctggatg 1260 
ttgatgatgg cctagcagct gttggttact ccaaagctga tatccccgca ctagtgaaag 1320 
gaacgctgcc ccaggaaagg gteaccaagc ttgcaccccg tccccagtca gaagaggatc 1380 
tggctgctct gtttgaagct tcaatgaaac tgtattaatt gtcattttaa ctgaaagaat 1440 
taccgctggc cattgtagtg ctgagagcaa gagctgatct agctagggct ttgtcttttc 1500 
atctttgtgc ataacttacc tgttaccagt ataggtggga tatacattta tcttgcagga 1560 
aattccccaa agctcagagt ccagttcctt ccataaaaca ggctggacaa atgaccacta 1620 
tgttagaccc ccaggctcga cttcaggggt cagtgttcct gtcccaaacc ccacacagaa 1680 
tactctgcct ctgcttcatg tagcaaatga gcaaaaactc agtatctatc aaaagtgtaa 1740 
attatatttc ctatgcctag taattcactt catgtctaaa aatttatctg atagaaacac 1800 
tagcaccagt acatacagaa gcatggcaag gatgtttctg gcagcacttt tctaataata 1860 
aaagatttga aacaaaaaaa aaaa 1884 



<210> 47 

<211> 1400 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4293484CB1 

<400> 47 

ctccaggcca gtctgggcac cctgggatag cggctgcagc tcccaggatc agcfcctgccc €0 
tcccgccaaa cgccagcctc gtcaccgctc cagggcacct ccagcagtaa caggtggttg 120 
cagcaggtgg cagccagccc ctggatgagc caaggtctct tccccagcca ggcatggccg 180 
actctgcaca ggcccagaag ctggtgtacc tggtcacagg g^gctgtggc ttcctgggag 240 
agcacgtggt gcgaatgctg ctgcagcggg agccccggct cggggagctg cgggtctttg 300 
accaacacct gggtccctgg ctggaggagc tgaagacagg tacccggaac gtgatcgagg 360 
cttgtgtgca gaccggaaca cggttoctgg tctacaccag cagcatggaa gttgtggggc 420 
-ctaacaccaa aggtcacccc ttctacaggg gcaacgaaga caccccatac gaagcagtgc 430 
.acaggcaccc ctatccttgc agcaaggccc tggccgagtg gctggtcctg gaggocaacg 540 
ggaggaaggt ccgtgggggg ctgcccctgg tgacgtgtgc ccttcgtccc acgggcatxrt 600 
acggtgaagg ccaccagatc atgagggact tctaccgcca gggcctgcgc ctgggaggtt 660 
ggctcttccg ggccatcccg gcctctgtgg agcatggccg ggtctatgtg ggcaatgttg 720 
cctggatgca cgtgctggca gcccgggagc tggagcagcg ggcaaccotg atgggcggcc 780 
aggtatactt ctgctacgat ggatcaccct acaggagcta cgaggatttc aacatggagt 840 
tcctgggccc ctgcggactg cggctggtgg gcgcocgocc attgctgccc tactggctgc 900 
tggtgttcct ggctgccctc aatgccctgc tgcagtggct gctgcggcca ct^gtgctct 960 
acgcacccct gctgaacccc tacacgctgg ccgtggccaa caccaccttc accgtcagca 1020 
ccgacaaggc tcagcgccat ttcggctatg agcccctgtt ctcgtgggag gatagccgga 1080 
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cccgcaccat tctctgggta caggccgcta cgggttcagc 
gcctggaggc ccagatacag cacatccacc caggtcccga 
aagggacagc tgcattccag agcaggaggc agggctctgg 
tcgtagagcc ctccacattt tctttttctt ttttgagaca 
gactggagtg cagtggtgtg atcatagctc actgcaccct 
atcctcctgc ctcagcctcc 



ccagtgacgg 
gccctcacac 
ggccagaatg 
gggtcttgct 
caacctcctg 



PCTAJS00/33158 

tggggctggg 1140 
cctggacggg 1200 
gctgtccttg 1260 
ctgtcaccca 1320 
ggttcaagca 1380 
1400 



<210> 48 
<211> 1313 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4440080CB1 



<400> 48 

gaggctggct 

tcataggagc 

agatgtgacc 

cacagcggtg 

tcaggcagca 

tgcacacggg 

aacatcaata 

gccaccaagt 

cagacccaca 

aaactccacg 

aaacccgagg 

attggagaca 

cttccctccc 

gatcacggga 

tttttatatc 

gaggtggtgt 

ctttgtctgc 

atcttcttgc 

ttatatctgt 

tctccccctt 

ctggggtcat 

gtggatttca 



agggagccgt 

tggctgctga 

tatcaaatga 

tagacatctg 

ccagtggttg 

aagcctacca 

gcatgtctgg 

atgccgtcac 

tccgagccac 

acaaggaccc 

atgtggccga 

tccagatgag 

cacccttcat 

taccacttcc 

atcttgtcaa 

ccctaattgt 

tctcagtgtc 

acctcaacgt 

gttgttatcc 

atctatctcc 

ccctccactc 

tggtgatcat 



gggagtgagc 

atgtaagagt 

agaggacatc 

catcaacaat 

gaaggacatg 

gtccatgaag 

ccaccgagtg 

tgcgctgaca 

gtgcatctct 

tgagaaggca 

ggctgttatc 

gcccacggag 

ggcttgcctc 

tgtccacacc 

attgcttcag 

tttacttgtt 

ttccctttga 

ctgtggctca 

agggctccag 

ttctcggctc 

tgactctgac 

taaaaaagaa 



tgctcacccc 

gcaggctacc 

ctctccatgt 

gctggcttgg 

ttcaatgtga 

gagcggaatg 

ttacccctgt 

gagggactga 

ccaggtgtgg 

gctgccacct 

tacgtcctca 

caggtgacct 

ctgcctctgg 

ccgaccaggg 

ttgtaaatgt 

aacttgttct 

catgggaaag 

gggctggggt 

acttcctcct 

cccagcccag 

tatggcagca 

aaatcgcaac 



agtcagaccc 

ccgggacttt 

tctcagctat 

cccggcctga 

acgtgctggc 

tggacgatgg 

ctgtgaccca 

ggcaagagct 

tggagacaca 

atgagcaaat 

gcacccccgc 

agtgactgtg 

attttaggtg 

gctagaaaat 

gaaaaatggg 

tgtgcccctg 

gagttgtggc 

ggcagaggga 

ctgcctgccc 

tcttggcttc 

gaacaccagg 

caaaaaaaaa 



ctgttgggtt 

gatcccctac 

ccgttctcag 

caccctgctc 

cctcagcatc 

gcacatcatt 

cttctatagt 

tcgggaggcc 

attcgccttc 

gaagtgtctc 

acacatccag 

ggagctcctc 

ttgatttctg 

ttgtttgaga 

ctggggaaag 

ggcacttggc 

caaaatcccc 

ggccttcacc 

cactgcaccc 

ttgtcccctc 

gcctggccca 

aaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
13 IT 



<210> 49 

<211> 1459 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_ feature 

<223> Incyte ID No: 5495687CB1 

<400> 49 

gggccgcgcc cgcgctccca ggctctcctc 
tcgcgtagcc gtgcgccgat tgcctctcgg 
gaccgccccg cgtcatgcgg ctcctcggct 
ttcccgtccg cggcgtggag gttgcagagg 
ctgctcaccc tctccaggtg ggggctgtgt 
cgatgggcca ggacagggca gcagaagagg 
gcgatcacat ggtgatgctg tctgtgattc 
agcctagcgg cgtcacctgt ggtgctggag 
agagcctttt ctctctggat ggcgctggag 
acacagagcc agaagtggcg gaatctgacg 
aaagtctgaa atccccaaag gtgaactgtg 
tcactctgaa aattttaaat atgtcacagg 
gtgactgtac tctagtcctg ttttacaccc 
ctcactttaa ctctctgccc cgggcatttc 
ctcagcacag cagcctttct accaggtttg 



ccccagcctt cctccggctg gcagcacgac 60 
cctgggcaat ggtcccggct gccggtcgac 120 
ggtggcaagt attgctgtgg gtgctgggac 180 
aaagtggtcg cttatggtca gaggagcagc 240 
acctgggtga ggaggagctc ctgcatgacc 300 
ccaatgcggt gctggggctg gacacccaag 360 
ctggggaagc tgaggacaaa gtgagttcag 420 
gagcggagga ctcaaggtgc aacgtccgag 480 
cacacttccc tgacagagaa gaggagtatt 540 
cagccccgac agaggactcc aataacactg 600 
aggagagaaa cattacagga ttagaaaatt 660 
accttatgga ttttctgaac ccaaacggta 720 
cgtggtgccg cttttctgcc agtttggccc 780 
cagctcttca ctttttggca ctggatgcat 840 
gcaccgtagc tgttcctaat attttattat 900 



36/40 



BNSDOC1D* <WO 0144448A2 I > 



WO 01/44448 



PCT/US00/33158 



ttcaaggagc taaaccaatg gccagattta atcatacaga tcgaacactg gaaacactga 960 

aaatcttcat ttttaatcag acaggtatag aagccaagaa gaatgtggtg gtaactcaag 1020 

ccgaccaaat aggccctctt cccagcactt tgataaaaag tgtggactgg ttgcttgtat 1080 

tttccttatt ctttttaatt agttttatta tgtatgctac cattcgaact gagagtattc 1140 

ggtggctaat tccaggacaa gagcaggaac atgtggagta gtgatggtct gaaagaagtt 1200 

ggaaagagga acttcaatcc ttcgtttcag aaattagtgc tacagtttca tacattttct 1260 

ccagtgacgt gttgacttga aacttcaggc agattaaaag aatcatttgt tgaacaactg 1320 

aatgtataaa aaaattataa actggtgttt taactagtat tgcaataagc aaatgcaaaa 1380 
atattcaata gatgcactat tcttgttttt actgcatgaa cgtaattcag tatttggaaa 1440 
gtaatccagt ttgaaatgg 1459 



<210> 50 

<211> 2101 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 55277 3 5CB1 

<400> 50 

cggctcgagg cggccgtcat ggcgtcgccc ttcagcgggg cgctgcagct gacggacctg 60 
gatgacttca tcgggccgtc tcaggagtgc atcaagcctg tcaaagtgga aaaaagggcg 120 
ggaagtggcg tggccaagat tcgcattgaa gatgacggga gctacttcca aattaaccaa 180 
gacggcggga cccggaggct ggagaaggcc aaggtctcgc taaacgactg cctggcgtgc 240 
agcggctgca tcacctccgc agagaccgtg cttatcaccc agcagagcca cgaggagctg 300 
aagaaggttc tagatgctaa caagatggcg gcacccagtc agcagaggct ggttgtagtt 360 
tcggtctcac cacagtctag agcatcgctg gctgcacggt ttcagctgaa tcctacagat 420 
actgccagga aattaacctc attctttaaa aaaatagggg tgcacttcgt cttcgacacc 480 
gccttctcaa ggcacttcag cctcctggag agccagcgag agtttgtgcg gcgattccga 540 
ggacaggccg actgcagaca ggcactgccc ctgctggcct ctgcctgccc aggctggatc 600 
tgctatgccg agaagactca cggcagcttc atcctccccc acatcagcac cgcccggtcc 660 
ccgcagcagg tcatgggctc cctggtcaag gacttcttcg cccagcagca gcacttgacc 720 
cctgacaaga tctaccacgt cacagtgatg ccctgctatg acaaaaagct ggaagcctcc 780 
agacccgact ttttcaacca ggagcaccag acacgggatg tggactgtgt cctcaeaaca 840 
ggagaagttt t<:aggttgct ggaggaagag ggcgtctccc tccccgacct ggaaccagcc 900 
cctctggaca gcctgtgcag cggtgcctct gcagaggagc ccaccagcca tcggggaggg 960 
ggctcggggg gctacctgga gcatgtgttc cggcacgcgg cccgagagct ctttggaatc 1020 
catgtggctg aggttaccta caaacccctg aggaacaaag acttccagga ggtgacactg 1080* 
gagaaggagg gccaggtgct gctgcacttc gcaatggcgt acggcttccg caacatccag 1140 
aacctggtgc agaggctcaa acgagggcgc tgcccctacc actacgtgga ggtcatggcc 1200 
tgcccctcag gctgcctgaa cggcgggggc cagctccagg ccccagacag gcccagcaga 1260 
gagctcctcc agcacgtgga gagactgtac ggcatggtcc gggctgaggc gcccgaggac 1320 
gcgcctgggg ttcaggagct gtacacacac tggctgcagg gcacggactc ggagtgtgca 1380 
ggtcgcttgc tgcatacgca gtaccacgcc gtggagaagg ccagcactgg -cctgggcatc 1440 
cggtggtagg ggctgcagga ccaggactcc caggaggccg tgtccatgtg tgacagcaga 1500 
accacatgcc ccaagacccc agggcttccc ccaaaattct gagtgagctg cagggtgtgc 1560 
tgggacccga gtaggagcta ggactagcca ggacccgcag ccgcctcgtc acctccagtt 1620 
gggtgcctct gggttcccac tggctctgcc caggtggggt ggggtggccc aggcagcaga 1680 
aggttccctg aggtcocaga gcctgttccg ttggccctgg gccgaggccc acaggtgctg 1740 
cccttgctgc tgctggtcgg gcacccaagt gcgtgagggg cttcagcctg tcecggggtt 1800 
gcctgaggca gagcaagacg ggttctcacc cctgact-tct ggaggcttcc cttgaagctc 1860 
tgtgcaaaag gtgggagaca gagctggacc tgcaggggtg gtcccgccac aaccctgcgt 1920 
gtggaccctg gcaggggggg ggtgccaggc ccctggaaag caggggttac cgttacgagg 1980 
ctgtggtccg gggcaagcca agtacgaagc agcagccatc gcgggctgca tcatccccca 2040 
gccaggtccc caccaggcct gtctcccagc gtttgtctaa taaacgcacc cctcgtaaaa 2100 



<210> 51 

<211> 2440 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
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<223> Incyte ID No: 5540437CB1 
<400> 51 

gtgtccctgc ggcgctaaga aggggagact 
tgagcggctg cgggctcttc ctgcgcacca 
tggtctctac cgcgaaccgg cggctactgc 
aagagctttt cctaggcaaa atcaagaaga 
aagatgaact taatgaaatc aatcagttct 
aggtggactc ccgaaaaatt gaccaggaag 
tgaagagcct agggcttttt gggctgcaag 
ccaacaccat gtactcaaga ctaggggaga 
ccctggcagc gcaccaggct attggcctca 
agaaagccaa atacttgcct aaactggcgt 
cggagccagc cagtgggagc gatgcagcct 
acaagaagca ctacatcctc aatggctcca 
atatttttac tgtgtttgca aagactgagg 
aaatcacagc attcatagta gaaagagact 
ataaattagg cattcggggc tccaacactt 
ctgtggaaaa catccttgga gaggtcggag 
acagcggccg gttcagcatg ggcagcgtcg 
tgactgctga gtacgcctgc acaaggaaac 
tgattcagga gaaatttgca ctgatggctc 
acctcacagc agggatgctg gaccaacctg 
tggtgaaggt gtt cage tec gaggccgcct 
tegggggett gggctacaca agggactatc 
tcctcctcat cttcgaggga accaatgaga 
tgcagcatgc cggccgcatc ctgactacca 
gcacagtcat ggataccgtt ggceggagge 
tggggctgac aggcaaccat ggagttgtgc 
ttgaggagaa cacctactgc ttcggccgga 
agaccatcat ggaggagcag ctggtactga 
atggcatgac ggccgtgctg tcgcgggcca 
acgaccacga ggttctcttg gccaacacct 
tcagcctctc tcagctggac aagtatgetc 
tgtcccagca gatccttgag aagegagect 
gctgaggcag gggacagtgt cccctgctac 
atgactgtta ctcttttttc agaaggtgtt 
ccgtctgcac ctgaagggtt gtcgcctggc 
caggcagtgc tctctaacag gaccatcaca 
gaattgctga cccctggaac tggegggtat 
actggggctt atgctgctgc ctccagggtg 
gatagecatt tctgctcaac cacacattct 
teat teat tt aaactagaag cagaggcact 
gaatataaaa tgtcacaatc tgtgtactgt 



gaggctgagg ctggggaaca tegggcagea 60 
cggctgcggc tcgtgcctgc eggggtctgg 120 
gcaccagccc gectgtaega gctttcgcca 180 
aagaagtttt cccatttcca gaagttagcc 240 
tgggacccgt ggaaaaattc ttcactgaag 3 00 
ggaaaatccc agatgaaact ttggagaaat 360 
tcccagaaga atatggtggc ctgggcttct 420 
tcatcagcat ggatgggtcc atcactgtga 480 
aggggatcat cttggctggc actgaggagc 540 
ceggggagea cattgcagcc ttctgcctca 600 
caatceggag cagagccaca ctaagtgaag 660 
aggtctggat tactaatgga ggactggcca 720 
tegttgatte tgatggatca gtgaaagaca 780 
ttggtggagt cactaatggg aaacccgaag 840 
gtgaagtcca ttttgaaaac accaagatac 900 
atgggtttaa ggtggccatg aacatcctca 960 
tggctgggct gctcaagaga ttgattgaaa 1020 
agtttaacaa gaggctcagt gaatttggat 1080 
agaaggctta cgtcatggag agtatgacct 1140 
gctttcccga ctgctccatc gaggcageca 1200 
ggcagtgtgt gagtgaggcg ctgcagatcc 1260 
cgtacgagcg catactgegt gacacccgca 1320 
ttctceggat gtacatcgcc ctgacgggtc 1380 
ggatccatga gcttaaacag gccaaagtga 1440 
ttegggaetc cctgggccga actgtggacc 1500 
accctagtct tgcggacagt gecaacaagt 1560 
ccgtggagac actgctgctc cgctttggca 1620 
agcgggtggc caacatcctc atcaacctgt 1680 
gccgctccat ccgcattggg ctccgcaacc 1740 
tctgcgtgga agcttacttg cagaatctct 1800 
cagaaaacct agatgagcag attaagaaag 1860 
atatctgtgc ccaccctctg gacaggacat 1920 
cgcccgcccc tacccatggc ccgttgctgg 1980 
gggattatca caggttaagc cttttgttcc 2040 
ctgggagagc ctcttccagg ttttgacctg 2100. 
gcttctgaac tgagceggag agagagaatg 2160*: 
tctggtcatt gaggagacac catagtggaa 2220 
tgaggtgggt ggggacctgt gtcaggtgtg 2280 
ctaagaaaca gcttgaaagc tctgtctggg 2340 
taaaacatgt accaggaacc atttaacaaa 2400 
taaaaaaaaa 2440 



<210> 52 

<211> 1072 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5596281CB1 

<400> 52 

cggacggtgg gtccagacaa aggttgtctc 
acgccatcag ctcgctgctt aaaattaaac 
gaaagteatc atectgaegg ccgctgctca 
tgcaagagaa ggtgccaaag tcatagccac 
ggaaaagtac cegggtatte aaactcgtgt 
teagtttgee agtgaagttg agagacttga 
tcatggaact gtcctggatt gtgaggagaa 
gcgcagcatg tacctgatga tcaaggcatt 
caatattatc aacatgtctt ctgtggcttc 
gtacagcaca accaaggcag ccgtgattgg 
ccagcagggc ateaggtgea actgtgtgtg 



aagtttgttg ctcaaaccga gttctggaga 60 
cacaggttcc attatgggtc gacttgatgg 120 
ggggattggc caagcagctg ccttagcttt 180 
agacattaat gagtccaaac ttcaggaact 240 
ccttgatgtc acaaagaaga aacaaattga 3 00 
tgttctcttt aatgttgctg gttttgtcca 360 
agactgggac ttctcgatga atctcaatgt 420 
ccttcctaaa atgettgetc agaaatctgg 480 
cagegtcaaa ggagttgtga acagatgtgt 540 
cctcacaaaa tctgtggctg cagatttcat 600 
cccaggaaca gttgataege catctctaca 660 
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agaaagaata caagccagag 
aaagacggga agattcgcaa 
tgatgaatct gcttatgtaa 
attttaggat ctccatggtg 
tacgaagaaa actcaccaat 
ctttttaatg atgtcactgt 
aatctcttct gaaatcattg 



gaaatcctga 
ctgcagaaga 
ctggtaaccc 
ggaaggaagg 
catctccttc 
ttgcaagagt 
taaagaaata 



agaggcacgg 
aatagccatg 
tgtcatcatt 
caggcccttc 
ctgttaatca 
ctgattcttt 
aaaatattga 



aatgatttcc 
ctctgcgtgt 
gatggaggct 
ctatccacag 
catgttaatg 
aagtatatta 
actcaaaaaa 



tgaagagaca 720 
atttggcttc 780 
ggagcttgtg 840 
tgaacctggt .900 
aaaataagct 960 
atctctttgt 1020 
aa 1072 



<210> 53 

<211> 1040 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



5731013CB1 



<400> 53 

ccttcggctc 

gcttaggctc 

cttggccagc 

cttcagcgtc 

ggtgatcgac 

ctgcgtgccc 

ctgcaagtcg 

ggacgtgatc 

gcaccccttc 

tcctgcccca 

ctttggactg 

atggaaccca 

gtttctttcc 

agtggcatcc 

tttaaatcca 

ctgaagaatc 

tgggctaggc 

ccaaggtcaa 



gagcggcaac 
cgcggcctcc 
gttctccaca 
agccgcgacg 
ttcgcccgac 
agagtagtgg 
gtcgaggaga 
cgcatccgca 
accaacaagc 
gcccaggtgc 
ttactccggt 
tcaatgggca 
agtgcctgct 
ctaaaacctg 
aattgataat 
cctgatgtct 
actttatatc 
aaaaaaaaaa 



cggaagcctc 
aagc tgtagc 
acggactggg 
gcgcctcgtc 
ggaatccagg 
ccgaatacct 
tctcgacgct 
agcccttcca 
cgaccacgtt 
aagcacagtg 
aaaggtggtt 
agttgacaga 
tctgggggca 
agatgaggaa 
aatgatctca 
tattggaaca 
tgtgtgaata 



gaggtttagt 
tatgacggcg 
tcgctatgtg 
tcgcggcgcc 
ggtcgtaata 
taacggggct 
ggtgcagaag 
caccgacaac 
ccgcgggcta 
aagagttgcc 
cttccccttt 
ggttctgctt 
gtgaccttgt 
gacttcaagg 
aaacacagtg 
accactgagc 
cagatttata 



cccgcccccc 
cgcgggactc 
cagcagctgc 
agggagttcg 
tatgtaaact 
gtgcgcgagg 
ctggccgaec 
cctagcatcc 
cgcccccgag 
ccaccaactg 
gggattccaa 
gggataatga 
gaaocactca 
gttttacagg 
agaggtctga 
tacggagagc 
aaacaggtta 



tctcctcgct 
cgagccgctt 
agcgtctgag 
tggagcggga 
cgcgtccgtg 
agagcatcca 
agtcgggctt 
agggccagtg 
aggttcagga 
cagccccagg 
gcccaggcaa 
agagctgcct 
tttttatgca 
acccttgttt 
aggctggctt 
tctgctgtga 
ataaacttat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1040 



<210> 54 
<211> 1624 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 



5731162CB1 



<400> 54 

ctcagcctcg 

tatttttagt 

cgtgatccgc 

ggcctgcagg 

tatcttgggc 

atggcggcgc 

tttcccggca 

ctgcctgagc 

attcttgtct 

aatggatgtt 

agcccagggc 

gaaaggcgtt 

aattggccac 

ggtgcagacc 

catctttaag 

gtacacggag 

ggagaacggc 

ggtccgggcc 



cgagcagctg 
agagacgggg 
gcgcctctgc 
tgttgttctg 
ctgaggattg 
ctggactggg 
cctggaggcc 
ccctcgcagc 
gcggaagagt 
cctctccact 
agcacagact 
tttgatgaga 
gctctgcacg 
ttggccagaa 
caacctcact 
cccctgggcc 
tgtctctggt 
cctgttggct 



ggactacagg 
tttcaccgtg 
ctccaaagtg 
accccagctc 
aggcct'agag 
actcagccca 
gcgcttagaa 
tccagaagtt 
gtgtggccat 
gccgcacaga 
atttcttgag 
aaggaaattt 
cccacgaccc 
gtctgggcct 
ttggcggtga 
gggtgctggg 
tcatccctgg 
cagcgcctgg 



tgcccgccac 
ttagccagga 
ctgggattac 
cactgtgcca 
agaggcaggc 
gctgcttggc 
gccgcccagt 
ccaacaggat 
gcaacaaagg 
attctccacc 
cagtggtgac 
,-cctggtcoct 
cgtcttcaag 
ccagatgccc 
agtctcccct 
cgtgtggatc 
ctcccacacc 
tacoagcttc 



cacgcccggc 
tggtctcgat 
aggcccgagc 
gcccgactfcg 
gttcctcaga 
ctgaccctct 
gccctgagcg 
ggattcctgg 
attggcgaga 
caggaagagg 
aagattcgat 
ccggagaaat 
agcatcacac 
gtggtggtgc 
catcaggacg 
gcagtggagg 
agtggtgtgt 
cttgggtcag 



taattttttg 
ttoctgacct 
cgccgcaccc 
agatgccaag 
gtcacaggga 
cacagcataa 
tctccatggc 
tgctggaagg 
tagtggctga 
agcagcttcg 
tcttctttga 
-ccatcaacaa 
actccttcaa 
agagcatgta 
cc tec t tec t 
atgccacgct 
caagaaggat 
agccagcccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1O80 
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ggataacagc ctctttgtgc ccaccccagt 
agaagtggta cacaagagca agcagaacct 
ccacctcatg gaggcctctg gcaccacctg 
tgaactgccc tttccccaac tgtacaccta 
ctcccgggtg aagctgtggg ctgtaaacac 
agggaggtct tgtctcccct cctgggcttt 
gggtcagggg cttccctaag atcttcacct 
tgaggaggct tctcagccac caaagggttc 
atggaactct ggttattatg gtgttagtta 
aaaa 



gcagagaggg gccctggtcc tcatccatgg 1140 
ctctgaccgc tcgcgccagg cctacacttt 1200 
gagcccggag aactggctcc agccaacagc 1260 
aaggctctcg cagggcagga gccctcgccc 1320 
cagtgccttg ctcagcctcc tggttgcaac 1380 
cctcctgccc tgtgggcagc agcctaggct 1440 
ctctgcctcc ctactgcccc aacatagcct 1500 
tggccccttc tcactctcct ctcctctcag 1560 
tcgaataaaa acgacttcag aatgaaaaaa 1620 

1624 
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"O" document referring to an oral disclosure, use. exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



T" later document published after the international filing date 
or priority date and not in conflict with the application but 
eked to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
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Box I Observations where certain claims were found unsearchable (Continuati n of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

1. [yl Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Although claim 18 is directed to a method of treatment of the human/animal 
body, the search has been carried out and based on the alleged effects of the 
compound/composition. 

2. PH Claims Nos.: 20, 21, 23, 24 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



see additional sheet 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers atl 
I * searchable claims. 

2. | | As alt searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' ' covers only those claims for which fees were paid, specifically claims Nos.: 



4. I y | No required additional search fees were timely paid by the applicant. Consequently, this International Search Report ij 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-28 partially 
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j [ The additional search fees were accompanied by the applicant's protest 
| [ No protest accompanied the payment of additional search fees. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-28 (partially) 

An isolated peptide comprising SEQ. ID. NO: 1, a sequence 
90% identical, an active fragment thereof, an imnunogenic 
fragment thereof, an isolated polynucleotide (encoding the 
polypeptide and corresponding to SEQ. ID. NO: 28;" -cells 
transformed with the polynucleotide linked to a promoter, a 
transgenic animal comprising the polynucleotide linked to a 
promoter, an antibody recognizing the polynucleotide^ 
mothod of detecting the polynucleotide by PCr or 
hybridization, a composition containing the polypeptide, 
methods for trating a disease by administering the 
composition, methods for screening agonists and antagonists 
of the polypeptide, compositions and methods linked to the 
alteration of the expression of the polypeptide, methods for 
determining the toxicity of the polypeptide. 

2. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 2 and 29 

3. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 3 and 30 

4. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 4 and 31 

5. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 5 and 32 

6. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 6 and 33 

7. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 7 and 34 

8. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 8 and 35 
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9. 


Claims: 1-28 (partially) 

Same as 1. with reference to SET) 
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Claims: 1-28 (partially) 
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11 and 38 


12. 


Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. 


ID. 


NO: 


12 and 39 


13. 


Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. 


ID. 


NO: 


13 and 40 


14. 


Claims: 1-28 (partially) 
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14 and 41 
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Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. 


ID. 


NO: 


17 and 44 


18. 


Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. 


10. 


NO: 


18 and 45 


19. 


Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. 


ID. 


NO: 


19 and 46 
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20. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 20 and 47 

21. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 20 and 48 

22. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 22 and 49 

23. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 23 and 50 

24. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 24 and 51 

25. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 25 and 52 

26. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 26 and 53 

27. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 27 and 54 



BNSDOC1D: <WO__0144448A3J_> 



page 3 of 3 



International Application No. PCT/US 00 /33158 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



Continuation of Box 1.2 
Claims Nos.: 20, 21, 23, 24 



Present claims 20, 21, 23 and 24 relate to an extremely large number of 
possible compounds (agonists/antagonists)- In fact, the claims contain so 
many options that a lack of clarity within the meaning of Article 6 PCT 
arises to such an extent as to render a meaningful search of the claims 
impossible. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 



